Skip to content
Change the repository type filter

All

    Repositories list

    • sparsify

      Public
      Sparsify transformers with SAEs and transcoders
      Python
      MIT License
      6449735Updated Mar 26, 2025Mar 26, 2025
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      2.2k8.4k357104Updated Mar 26, 2025Mar 26, 2025
    • gpt-neox

      Public
      An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
      Python
      Apache License 2.0
      1k7.1k6424Updated Mar 24, 2025Mar 24, 2025
    • elk

      Public
      Keeping language models honest by directly eliciting knowledge encoded in their activations.
      Python
      MIT License
      331971510Updated Mar 24, 2025Mar 24, 2025
    • delphi

      Public
      Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.
      Python
      Apache License 2.0
      2316523Updated Mar 24, 2025Mar 24, 2025
    • tyche

      Public
      Precisely estimating the volume of basins in neural net parameter space corresponding to interpretable behaviors
      Jupyter Notebook
      Apache License 2.0
      0500Updated Mar 23, 2025Mar 23, 2025
    • rtopk

      Public
      Cuda
      MIT License
      0100Updated Mar 22, 2025Mar 22, 2025
    • POSER

      Public
      Poser: Unmasking Alignment Faking LLMs by Manipulating Their Internals
      Python
      1000Updated Mar 21, 2025Mar 21, 2025
    • ccs

      Public
      Python
      MIT License
      6513Updated Mar 21, 2025Mar 21, 2025
    • MIT License
      0000Updated Mar 17, 2025Mar 17, 2025
    • pythia

      Public
      The hub for EleutherAI's work on interpretability and learning dynamics
      Jupyter Notebook
      Apache License 2.0
      1812.4k265Updated Mar 13, 2025Mar 13, 2025
    • DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
      Python
      Apache License 2.0
      4.3k16401Updated Mar 10, 2025Mar 10, 2025
    • cookbook

      Public
      Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
      Python
      Apache License 2.0
      4078380Updated Mar 3, 2025Mar 3, 2025
    • cupbearer

      Public
      A library for mechanistic anomaly detection
      Jupyter Notebook
      MIT License
      10600Updated Feb 26, 2025Feb 26, 2025
    • clearnets

      Public
      Python
      MIT License
      0400Updated Feb 18, 2025Feb 18, 2025
    • Closed-form polynomial approximations to neural networks
      Python
      MIT License
      01100Updated Jan 31, 2025Jan 31, 2025
    • Experiments in transformer knowledge and reasoning
      Jupyter Notebook
      MIT License
      181000Updated Jan 30, 2025Jan 30, 2025
    • A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
      Python
      Apache License 2.0
      389000Updated Jan 29, 2025Jan 29, 2025
    • Acompanying code for our research on SAE feature overlap when trained on different seeds.
      Jupyter Notebook
      Apache License 2.0
      1300Updated Jan 28, 2025Jan 28, 2025
    • mdl

      Public
      Minimum Description Length probing for neural network representations
      Python
      MIT License
      21902Updated Jan 28, 2025Jan 28, 2025
    • MIDI tokenizers and pre-processing utils.
      Python
      Apache License 2.0
      1100Updated Jan 27, 2025Jan 27, 2025
    • Erasing concepts from neural representations with provable guarantees
      Python
      MIT License
      1522622Updated Jan 27, 2025Jan 27, 2025
    • aria

      Public
      Python
      Apache License 2.0
      114400Updated Dec 24, 2024Dec 24, 2024
    • Jupyter Notebook
      MIT License
      0400Updated Dec 14, 2024Dec 14, 2024
    • website

      Public
      New website for EleutherAI based on Hugo static site generator
      HTML
      5402Updated Dec 12, 2024Dec 12, 2024
    • Jupyter Notebook
      Apache License 2.0
      22100Updated Dec 11, 2024Dec 11, 2024
    • aria-amt

      Public
      Efficient and robust implementation of seq-to-seq automatic piano transcription.
      Python
      Apache License 2.0
      83400Updated Dec 2, 2024Dec 2, 2024
    • The simplest, fastest repository for training/finetuning medium-sized GPTs.
      Python
      MIT License
      6.6k10300Updated Nov 19, 2024Nov 19, 2024
    • Jupyter Notebook
      54414Updated Nov 17, 2024Nov 17, 2024
    • monkfish

      Public
      Python
      MIT License
      1400Updated Nov 1, 2024Nov 1, 2024