Skip to content
Change the repository type filter

All

    Repositories list

    • evalchemy

      Public
      Automatic evals for LLMs
      HTML
      574621312Updated Jun 27, 2025Jun 27, 2025
    • HTML
      2600Updated Jun 15, 2025Jun 15, 2025
    • open_clip

      Public
      An open source implementation of CLIP.
      Python
      Other
      1.1k12k4833Updated Jun 10, 2025Jun 10, 2025
    • open_lm

      Public
      A repository for research on medium sized language models.
      Python
      MIT License
      715033435Updated Jun 6, 2025Jun 6, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      Apache License 2.0
      30k000Updated May 18, 2025May 18, 2025
    • datacomp

      Public
      DataComp: In search of the next generation of multimodal datasets
      Python
      Other
      60722273Updated Apr 28, 2025Apr 28, 2025
    • dclm

      Public
      DataComp for Language Models
      HTML
      MIT License
      1221.3k152Updated Mar 19, 2025Mar 19, 2025
    • rtfm

      Public
      Research on Tabular Foundation Models
      Python
      MIT License
      1252120Updated Dec 13, 2024Dec 13, 2024
    • MixEval

      Public
      The official evaluation suite and dynamic data release for MixEval.
      Python
      41000Updated Sep 20, 2024Sep 20, 2024
    • An open-source framework for training large multimodal models.
      Python
      MIT License
      3064k455Updated Aug 31, 2024Aug 31, 2024
    • tabliblib

      Public
      A Python library for processing and filtering TabLib
      Python
      MIT License
      31100Updated Aug 24, 2024Aug 24, 2024
    • MINT-1T

      Public
      MINT-1T: A one trillion token multimodal interleaved dataset.
      1981910Updated Jul 31, 2024Jul 31, 2024
    • Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
      Python
      MIT License
      4647380Updated Jul 15, 2024Jul 15, 2024
    • A benchmark for distribution shift in tabular data
      Python
      MIT License
      1455121Updated Jun 6, 2024Jun 6, 2024
    • scaling

      Public
      Language models scale reliably with over-training and on downstream tasks
      Jupyter Notebook
      MIT License
      59720Updated Apr 2, 2024Apr 2, 2024
    • Python
      MIT License
      72600Updated Mar 21, 2024Mar 21, 2024
    • Editing Models with Task Arithmetic
      Python
      4148290Updated Jan 11, 2024Jan 11, 2024
    • Python
      25000Updated Oct 29, 2023Oct 29, 2023
    • patching

      Public
      Patching open-vocabulary models by interpolating weights
      Python
      MIT License
      89100Updated Sep 28, 2023Sep 28, 2023
    • Python
      2200Updated Aug 22, 2023Aug 22, 2023
    • LLM training code for MosaicML foundation models
      Python
      Apache License 2.0
      571100Updated Aug 10, 2023Aug 10, 2023
    • CSS
      MIT License
      0300Updated Jun 2, 2023Jun 2, 2023
    • Simple large-scale training of stable diffusion with multi-node support.
      Python
      913320Updated May 8, 2023May 8, 2023
    • Efficiently process webdatasets
      Python
      0410Updated Apr 5, 2023Apr 5, 2023
    • Release of ImageNet-Captions
      MIT License
      55000Updated Jan 20, 2023Jan 20, 2023
    • 0000Updated Jan 17, 2023Jan 17, 2023
    • Jupyter Notebook
      MIT License
      4710Updated Nov 3, 2022Nov 3, 2022
    • Python
      12900Updated Oct 18, 2022Oct 18, 2022
    • wise-ft

      Public
      Robust fine-tuning of zero-shot models
      Python
      Other
      7472190Updated Apr 29, 2022Apr 29, 2022
    • au21

      Public
      Jupyter Notebook
      MIT License
      0100Updated Nov 8, 2021Nov 8, 2021