Skip to content
Change the repository type filter

All

    Repositories list

    • OpenSAE

      Public
      Python
      24000Updated Jan 26, 2026Jan 26, 2026
    • VerIF

      Public
      [EMNLP 2025] Verification Engineering for RL in Instruction Following
      Python
      15034Updated Jan 5, 2026Jan 5, 2026
    • Python
      1617710Updated Dec 5, 2025Dec 5, 2025
    • AgentIF

      Public
      [NIPS 2025 DB Spotlight] AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
      Python
      12431Updated Dec 1, 2025Dec 1, 2025
    • BGPO

      Public
      Boundary-Guided Policy Optimization for Memory-Efficient RL of Diffusion Large Language Models
      Python
      0700Updated Oct 14, 2025Oct 14, 2025
    • DeepPrune

      Public
      🌿 DeepPrune: Parallel Scaling without Inter-trace Redundancy
      Python
      02000Updated Oct 10, 2025Oct 10, 2025
    • Python
      1600Updated Sep 12, 2025Sep 12, 2025
    • Linguistic-SAE

      Public archive
      Python
      1100Updated Sep 11, 2025Sep 11, 2025
    • LLMAEL

      Public
      [CIKM 2025] LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking
      Python
      11710Updated Sep 6, 2025Sep 6, 2025
    • ReaRAG

      Public
      ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation
      Python
      22500Updated Aug 24, 2025Aug 24, 2025
    • 0410Updated Jul 23, 2025Jul 23, 2025
    • RM-Bench

      Public
      [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
      Python
      37330Updated Jul 18, 2025Jul 18, 2025
    • Python
      21510Updated Jun 25, 2025Jun 25, 2025
    • Python
      42310Updated Jun 18, 2025Jun 18, 2025
    • [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
      Python
      712400Updated Jun 11, 2025Jun 11, 2025
    • AtomR

      Public
      [KDD 2025] AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning
      Jupyter Notebook
      21400Updated May 27, 2025May 27, 2025
    • MMGeoLM

      Public
      Python
      0910Updated May 27, 2025May 27, 2025
    • Crab

      Public
      [CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models
      Python
      01700Updated May 23, 2025May 23, 2025
    • Python
      01400Updated Apr 14, 2025Apr 14, 2025
    • [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models
      Python
      02200Updated Mar 29, 2025Mar 29, 2025
    • MRCEval

      Public
      MRCEval: A Comprehensive, Challenging and Accessible Machine Reading Comprehension Benchmark
      Python
      0400Updated Mar 12, 2025Mar 12, 2025
    • OmniEvent

      Public
      A comprehensive, unified and modular event extraction toolkit.
      Python
      39403104Updated Dec 18, 2024Dec 18, 2024
    • ADELIE

      Public
      [EMNLP2024] Aligning Large Language Models on Information Extraction
      Python
      25310Updated Nov 4, 2024Nov 4, 2024
    • KB-Plugin

      Public
      [EMNLP2024] KB-Plugin: A Plug-and-play Framework for Large Language Models to Induce Programs over Low-resourced Knowledge Bases
      Python
      1900Updated Oct 16, 2024Oct 16, 2024
    • The data and source code for the paper "MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student Modeling in MOOCs"
      Python
      35560Updated Oct 7, 2024Oct 7, 2024
    • DICE

      Public
      DICE: Detecting In-distribution Data Contamination with LLM's Internal State
      Python
      01100Updated Sep 21, 2024Sep 21, 2024
    • Data and code for the paper: Finding Safety Neurons in Large Language Models
      Jupyter Notebook
      01940Updated Sep 21, 2024Sep 21, 2024
    • Papers on LLM Reasoning and Retrieval-Augmented LLM Reasoning
      0800Updated Aug 27, 2024Aug 27, 2024
    • DiaKoP

      Public
      DiaKoP (CIKM Demo 2024)
      JavaScript
      0400Updated Aug 7, 2024Aug 7, 2024
    • Python
      0600Updated Jul 22, 2024Jul 22, 2024