Neuronpedia
Get Started
API
Releases
Jump To
Search
Models
Steer
SAE Evals
Blog/Podcast
NEW
Slack
Privacy & Terms
Contact
Sign In
© Neuronpedia 2025
Privacy & Terms
Blog/Podcast
GitHub
Slack
Twitter
Contact
Home
GPT2-Small
6
211
Prev
Next
MODEL
6
INDEX
Go
Explanations
the word 'how' in explanation
No Scores
the word 'how' can be similar to others but i insist it is.
No Scores
the word 'how'
No Scores
" How", regardless of spacing or capitalization
No Scores
"how", "what", and "understand"
No Scores
How
No Scores
phrases about the way things work, especially “how”
No Scores
phrases related to understanding or explaining concepts.
oai_token-act-pair · gpt-4-turbo
No Scores
mechanics of how things work
No Scores
Explanations
No Scores
ddddd
No Scores
New Auto-Interp
AutoInterp Type
claude-3-5-haiku-20241022
Generate
Top Features by Cosine Similarity
Embeds
Plots
Explanation
Show Test Field
Default Test Text
IFrame
<iframe src=https://www.neuronpedia.org/gpt2-small/6/211?embed=true&embedexplanation=true&embedplots=true&embedtest=true" title="Neuronpedia" style="height: 300px; width: 540px;"></iframe>
Link
https://www.neuronpedia.org/gpt2-small/6/211?embed=true&embedexplanation=true&embedplots=true&embedtest=true
Not in Any Lists
Add to List
▼
No Comments
ADD
Stacked
Snippet
Full
Show Breaks
Hide Breaks
No Known Activations