Skip to content

Pinned Loading

  1. OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5.5k 591

  2. dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.2k 138

  3. ai2thor Public

    An open-source platform for Visual AI.

    C# 1.4k 233

  4. olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 12.1k 817

  5. OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 725 63

Repositories

Showing 10 of 498 repositories
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    Python 23 Apache-2.0 5 0 8 Updated Apr 27, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    Python 2,927 Apache-2.0 376 13 14 Updated Apr 26, 2025
  • ai2thor Public

    An open-source platform for Visual AI.

    C# 1,352 Apache-2.0 233 252 4 Updated Apr 26, 2025
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    Python 200 Apache-2.0 35 1 17 Updated Apr 25, 2025
  • ai2-scholarqa-lib Public

    Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library

    Python 156 Apache-2.0 26 2 0 Updated Apr 25, 2025
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    Python 33 Apache-2.0 2 7 2 Updated Apr 25, 2025
  • Python 7 Apache-2.0 2 7 2 Updated Apr 25, 2025
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 12,057 Apache-2.0 817 80 17 Updated Apr 25, 2025
  • OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5,529 Apache-2.0 591 51 58 Updated Apr 25, 2025
  • beaker-py Public

    A pure-Python Beaker client

    Python 15 Apache-2.0 2 1 5 Updated Apr 24, 2025