Skip to content
Change the repository type filter

All

    Repositories list

    • attribute

      Public
      Python
      4700Updated Jul 5, 2025Jul 5, 2025
    • Sparsify transformers with cross-layer transcoders
      Python
      MIT License
      79201Updated Jul 4, 2025Jul 4, 2025
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      2.5k9.5k423138Updated Jul 4, 2025Jul 4, 2025
    • bergson

      Public
      Mapping out the "memory" of neural nets with data attribution
      Python
      MIT License
      41402Updated Jul 3, 2025Jul 3, 2025
    • delphi

      Public
      Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.
      Python
      Apache License 2.0
      3218851Updated Jul 2, 2025Jul 2, 2025
    • sparsify

      Public
      Sparsify transformers with SAEs and transcoders
      Python
      MIT License
      7958140Updated Jul 2, 2025Jul 2, 2025
    • aria

      Public
      Python
      Apache License 2.0
      115600Updated Jul 1, 2025Jul 1, 2025
    • gpt-neox

      Public
      An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
      Python
      Apache License 2.0
      1.1k7.2k6124Updated Jul 1, 2025Jul 1, 2025
    • elk

      Public
      Keeping language models honest by directly eliciting knowledge encoded in their activations.
      Python
      MIT License
      332071510Updated Jun 30, 2025Jun 30, 2025
    • website

      Public
      New website for EleutherAI based on Hugo static site generator
      HTML
      6512Updated Jun 29, 2025Jun 29, 2025
    • The simplest, fastest repository for training/finetuning medium-sized GPTs.
      Python
      MIT License
      7.1k14100Updated Jun 27, 2025Jun 27, 2025
    • Problems generated by djinn (exploitably verifiable coding problems)
      0000Updated Jun 27, 2025Jun 27, 2025
    • djinn

      Public
      Provide a lightweight framework for authoring and validating exploitable verifiable coding problems
      Python
      0000Updated Jun 25, 2025Jun 25, 2025
    • Linear probes with attention weighting
      Python
      1100Updated Jun 24, 2025Jun 24, 2025
    • Python
      MIT License
      57000Updated Jun 13, 2025Jun 13, 2025
    • cookbook

      Public
      Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
      Python
      Apache License 2.0
      4280181Updated Jun 9, 2025Jun 9, 2025
    • pythia

      Public
      The hub for EleutherAI's work on interpretability and learning dynamics
      Jupyter Notebook
      Apache License 2.0
      1882.6k133Updated Jun 9, 2025Jun 9, 2025
    • MIDI tokenizers and pre-processing utils.
      Python
      Apache License 2.0
      1100Updated Jun 5, 2025Jun 5, 2025
    • Investigating goal instability in RL
      Python
      MIT License
      0100Updated Jun 2, 2025Jun 2, 2025
    • DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
      Python
      Apache License 2.0
      4.5k16801Updated May 30, 2025May 30, 2025
    • open-r1

      Public
      Fully open reproduction of DeepSeek-R1
      Python
      Apache License 2.0
      2.3k400Updated May 21, 2025May 21, 2025
    • POSER

      Public
      Poser: Unmasking Alignment Faking LLMs by Manipulating Their Internals
      Python
      4200Updated May 21, 2025May 21, 2025
    • tyche

      Public
      Precisely estimating the volume of basins in neural net parameter space corresponding to interpretable behaviors
      Jupyter Notebook
      Apache License 2.0
      0802Updated May 21, 2025May 21, 2025
    • rtopk

      Public
      Cuda
      MIT License
      0100Updated May 20, 2025May 20, 2025
    • wmdp

      Public
      WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining general capabilities.
      Jupyter Notebook
      MIT License
      34000Updated May 15, 2025May 15, 2025
    • aria-amt

      Public
      Efficient and robust implementation of seq-to-seq automatic piano transcription.
      Python
      Apache License 2.0
      94700Updated May 5, 2025May 5, 2025
    • fmri

      Public
      Analogue of fMRI on artificial neural networks
      MIT License
      0200Updated Apr 24, 2025Apr 24, 2025
    • rllm

      Public
      Democratizing Reinforcement Learning for LLMs
      Jupyter Notebook
      MIT License
      333000Updated Apr 16, 2025Apr 16, 2025
    • Ongoing research training transformer models at scale
      Python
      Other
      2.9k000Updated Apr 15, 2025Apr 15, 2025
    • ccs

      Public
      Python
      MIT License
      6714Updated Mar 21, 2025Mar 21, 2025