Skip to content
Change the repository type filter

All

    Repositories list

    • CPM.cu

      Public
      CPM.cu is a lightweight, high-performance CUDA implementation for LLMs, optimized for end-device inference and featuring cutting-edge techniques in sparse architecture, speculative sampling and quantization.
      Cuda
      1416631Updated Aug 1, 2025Aug 1, 2025
    • MiniCPM-o

      Public
      MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
      Python
      1.5k20k5715Updated Jul 31, 2025Jul 31, 2025
    • RLPR

      Public
      Extrapolating RLVR to General Domains without Verifiers
      Python
      813400Updated Jul 29, 2025Jul 29, 2025
    • An easy-to-use, fast, and easily integrable tool for evaluating audio LLM
      Python
      4128110Updated Jul 28, 2025Jul 28, 2025
    • ChatDev

      Public
      Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
      Python
      3.4k27k2224Updated Jul 22, 2025Jul 22, 2025
    • MiniCPM

      Public
      MiniCPM4: Ultra-Efficient LLMs on End Devices, achieving 5+ speedup on typical end-side chips
      Jupyter Notebook
      5058.1k140Updated Jul 8, 2025Jul 8, 2025
    • This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achieving exceptional performance on the edge.
      Python
      2426402Updated Jul 1, 2025Jul 1, 2025
    • ParamMute

      Public
      ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation
      Python
      13810Updated Jun 20, 2025Jun 20, 2025
    • AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.
      Python
      95954180Updated Jun 14, 2025Jun 14, 2025
    • [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.
      Python
      916450Updated Jun 8, 2025Jun 8, 2025
    • sglang

      Public
      SGLang is a fast serving framework for large language models and vision language models.
      Python
      2.5k200Updated Jun 6, 2025Jun 6, 2025
    • C++
      33900Updated Jun 6, 2025Jun 6, 2025
    • OpenAct

      Public
      HTML
      0310Updated May 31, 2025May 31, 2025
    • BMTrain

      Public
      Efficient Training (including pre-training and fine-tuning) for Big Models
      Python
      8160455Updated May 29, 2025May 29, 2025
    • R1-Router

      Public
      This is the code repo for the paper "Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning".
      Python
      02200Updated May 29, 2025May 29, 2025
    • ToolBench

      Public
      [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
      Python
      4465.2k1327Updated May 21, 2025May 21, 2025
    • UltraRAG

      Public
      Build & Optimize your RAG.
      Python
      5672800Updated May 13, 2025May 13, 2025
    • RAGEval

      Public
      Python
      1318341Updated Apr 2, 2025Apr 2, 2025
    • ConsJudge

      Public
      Python
      01100Updated Mar 23, 2025Mar 23, 2025
    • An open platform for enhancing the capability of LLMs in workflow orchestration.
      Python
      2015930Updated Mar 11, 2025Mar 11, 2025
    • DEBATER

      Public
      This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search".
      Python
      22311Updated Mar 2, 2025Mar 2, 2025
    • VisRAG

      Public
      Parsing-free RAG supported by VLMs
      Python
      5976210Updated Feb 19, 2025Feb 19, 2025
    • UltraLink

      Public
      An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset
      Python
      32500Updated Jan 19, 2025Jan 19, 2025
    • RepoAgent

      Public
      An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.
      Python
      107741101Updated Dec 23, 2024Dec 23, 2024
    • Python
      2800Updated Dec 17, 2024Dec 17, 2024
    • RaD-Agent

      Public
      The official implementation of the Rational Decision-Making Agent with Internalized Utility Judgment
      Python
      1500Updated Nov 12, 2024Nov 12, 2024
    • UltraEval

      Public
      [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.
      Python
      2224540Updated Oct 30, 2024Oct 30, 2024
    • Locret

      Public
      Python
      1200Updated Oct 29, 2024Oct 29, 2024
    • RAG-DDR

      Public
      This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".
      Python
      22100Updated Oct 28, 2024Oct 28, 2024
    • IoA

      Public
      An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.
      Python
      76745101Updated Oct 20, 2024Oct 20, 2024