Neuronpedia
Get Started
API
Releases
Jump To
Search
Models
Steer
SAE Evals
Blog/Podcast
NEW
Slack
Privacy & Terms
Contact
Sign In
© Neuronpedia 2025
Privacy & Terms
Blog/Podcast
GitHub
Slack
Twitter
Contact
Home
GPT2-Small
6
337
Prev
Next
MODEL
6
INDEX
Go
Explanations
the phrases "of the", "is said", or "sure as"
No Scores
"the" or words where the letters "a" and "s" are adjacent (e.g., "as" and "said")
No Scores
"As" "the" "on" and variations of "say"
No Scores
grammatical article
No Scores
prepositions in idioms, often profane
No Scores
adjectives
No Scores
Words relating to our own actions when speaking informally
No Scores
Simile
No Scores
middle words in idioms
No Scores
situations where things are mentioned as true, factual, or happening.
oai_token-act-pair · gpt-4-turbo
No Scores
New Auto-Interp
AutoInterp Type
claude-3-5-haiku-20241022
Generate
Top Features by Cosine Similarity
Embeds
Plots
Explanation
Show Test Field
Default Test Text
IFrame
<iframe src=https://www.neuronpedia.org/gpt2-small/6/337?embed=true&embedexplanation=true&embedplots=true&embedtest=true" title="Neuronpedia" style="height: 300px; width: 540px;"></iframe>
Link
https://www.neuronpedia.org/gpt2-small/6/337?embed=true&embedexplanation=true&embedplots=true&embedtest=true
Not in Any Lists
Add to List
▼
No Comments
ADD
Stacked
Snippet
Full
Show Breaks
Hide Breaks
No Known Activations