Skip to content
Change the repository type filter

All

    Repositories list

    • Ressys benchmark code repo
      Python
      0100Updated Jul 20, 2025Jul 20, 2025
    • Official repository for Deep Research Comparator: A Platform For Fine-grained Human Annotations of Deep Research Agents [EMNLP 2025]
      Python
      0300Updated Jul 15, 2025Jul 15, 2025
    • Official repository for FactMM-RAG: Fact-Aware Multimodal Retrieval Augmentation for Accurate Medical Radiology Report Generation [NAACL 2025]
      Python
      1910Updated Jul 12, 2025Jul 12, 2025
    • AutoRule

      Public
      Official repository for AutoRule: Reasoning Chain-of-thought Extracted Rule-based Rewards Improve Preference Learning
      0410Updated Jun 18, 2025Jun 18, 2025
    • Python
      0600Updated Jun 9, 2025Jun 9, 2025
    • Python
      0010Updated May 30, 2025May 30, 2025
    • Python
      01510Updated May 21, 2025May 21, 2025
    • Organize the Web: Constructing Domains Enhances Pre-Training Data Curation
      Jupyter Notebook
      4000Updated May 2, 2025May 2, 2025
    • Python
      0100Updated Apr 2, 2025Apr 2, 2025
    • Interpret and control dense embedding via sparse autoencoder.
      Python
      0500Updated Mar 5, 2025Mar 5, 2025
    • Craw4LLM

      Public
      Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"
      Python
      5763330Updated Feb 24, 2025Feb 24, 2025
    • Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]
      Python
      44700Updated Jan 24, 2025Jan 24, 2025
    • RAGViz

      Public
      Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
      TypeScript
      138610Updated Jan 18, 2025Jan 18, 2025
    • MATES

      Public
      Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]
      Python
      97240Updated Nov 14, 2024Nov 14, 2024
    • esae

      Public
      Python
      0000Updated Oct 29, 2024Oct 29, 2024
    • Python
      0100Updated Oct 23, 2024Oct 23, 2024
    • Python
      1800Updated Aug 23, 2024Aug 23, 2024
    • Python
      0300Updated Jun 20, 2024Jun 20, 2024