New: March 2025
    Neuronpedia is Now Open Source
    Check out the code and download 4TB of interpretability data.
    Neuronpedia is an open source interpretability platform.
    Explore, steer, and experiment on AI models.
    MIT Technology ReviewGoogle DeepMindOpenMOSS, Fudan UniversityApollo ResearchMATSEleutherAI
    Explore
    Browse over four terabytes of activations, explanations, and metadata. Neuronpedia supports probes, latents/features, custom vectors, concepts, and more.
    Steer
    Modify model behavior by steering its activations using latents or custom vectors. Steering supports instruct (chat) and reasoning models, and has fully customizable temperature, strength, seed, etc.
    Steering example with a cat feature
    Search
    Search over 50,000,000 latents/vectors, either by semantic similarity to explanation text, or by running custom text via inference through a model to find top matches.

    Search via Inference

    Run Example Search
    API + Libraries
    Neuronpedia hosts the world's first interpretability API (March 2024) - and all functionality is available by API or Python/TypeScript libraries. Most endpoints have an OpenAPI spec and interactive docs.
    Steering example with a cat feature
    Inspect
    Go in depth on each probe/latent/feature with top activations, top logits, activation density, and live inference testing. All dashboards have unique links, can be compiled into sharable lists, and supports IFrame embedding, as demonstrated here.
    Who We Are
    Neuronpedia was created by Johnny Lin, an ex-Apple engineer who previously founded a privacy startup. Neuronpedia is supported by Decode Research, the Long Term Future Fund, and AISTOF.
    Citation
    @misc{neuronpedia,
        title = {Neuronpedia: Interactive Reference and Tooling for Analyzing Neural Networks},
        year = {2023},
        note = {Software available from neuronpedia.org},
        url = {https://www.neuronpedia.org},
        author = {Lin, Johnny}
    }