Neuronpedia
Get Started
API
Releases
Jump To
Search
Models
Steer
SAE Evals
Blog
Slack
Privacy & Terms
Contact
Sign In
© Neuronpedia 2025
Privacy & Terms
Blog/RSS
GitHub
Slack
Twitter
Contact
Home
Models
Gemma-2-2B-IT
gemma-2-2b-it
Google Deepmind
Releases
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders
July 2024
pyvene.ai, The Stanford NLP Group
axbench
Jump To
Jump to Source/SAE
MODEL
20-axbench-reft-r1-res-16k
Source/SAE
Go
Jump to Feature
MODEL
Source/SAE
INDEX
Go
Random Feature
Random
Search Explanations
All
By Release
By Model
By SAEs
MODEL
Show Dashboards
Hide Dashboards
Browse
MODEL
LAYER
Features in
GEMMA-2-2B-IT
@
20-axbench-reft-r1-res-16k
Hover over a feature on the left to preview its details.
Click a feature to lock it and interact with it.