Neuronpedia
Get Started
API
Releases
Jump To
Search
Models
Steer
SAE Evals
Blog/Podcast
NEW
Slack
Privacy & Terms
Contact
Sign In
© Neuronpedia 2025
Privacy & Terms
Blog/Podcast
GitHub
Slack
Twitter
Contact
Home
GPT2-Small
6
328
Prev
Next
MODEL
6
INDEX
Go
Explanations
"percent"
No Scores
number + percent
No Scores
number + percent, country names and surnames, personal pronouns
No Scores
percent designations
No Scores
"future past", "control", "prevention", percentages, shows (eg The Dark Night or The Deuce), et al, Irakli Garibashvili, U.S. Africa Command
No Scores
sequences of numbers and their connection to surrounding text.
oai_token-act-pair · gpt-4-turbo
No Scores
It responds to the end of a title.
No Scores
New Auto-Interp
AutoInterp Type
claude-3-5-haiku-20241022
Generate
Top Features by Cosine Similarity
Embeds
Plots
Explanation
Show Test Field
Default Test Text
IFrame
<iframe src=https://www.neuronpedia.org/gpt2-small/6/328?embed=true&embedexplanation=true&embedplots=true&embedtest=true" title="Neuronpedia" style="height: 300px; width: 540px;"></iframe>
Link
https://www.neuronpedia.org/gpt2-small/6/328?embed=true&embedexplanation=true&embedplots=true&embedtest=true
Not in Any Lists
Add to List
▼
No Comments
ADD
Stacked
Snippet
Full
Show Breaks
Hide Breaks
No Known Activations