Neuronpedia logo - a computer chip with a rounded viewfinder border around it

    Neuronpedia

    APICircuit TracerNEWSteerSAE EvalsBlog/PodcastSlackPrivacy & TermsContact
    © Neuronpedia 2025
    Privacy & TermsBlog/PodcastGitHubSlackTwitterContact
    1. Home
    2. AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders
    AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders
    pyvene.ai, The Stanford NLP Group
    ·github.com ↗
    axbench

    Jump To

    Jump to Source/SAE
    Jump to Feature
    INDEX
    Random Feature

    Search Explanations

    Browse

    Features in GEMMA-2-9B-IT@20-axbench-reft-r1-res-16k
    1. Hover over a feature on the left to preview its details.
    2. Click a feature to lock it and interact with it.