Neuronpedia
Get Started
API
Releases
Jump To
Search
Models
Steer
SAE Evals
Blog/Podcast
NEW
Slack
Privacy & Terms
Contact
Sign In
© Neuronpedia 2025
Privacy & Terms
Blog/Podcast
GitHub
Slack
Twitter
Contact
Home
GPT2-Small
6
644
Prev
Next
MODEL
6
INDEX
Go
Explanations
Contractions ending with “‘d be”
No Scores
phrases indicated hypotheticals (eg "wouldn't be", "it'd be", "I'd be", etc)
No Scores
Would or would not be
No Scores
the contraction ‘d or the phrase “wouldn’t be”
No Scores
contractions with "be" following
No Scores
contractions with the base verb "would."
oai_token-act-pair · gpt-4-turbo
No Scores
Statements about hypotheticals or counterfactuals
No Scores
New Auto-Interp
AutoInterp Type
claude-3-5-haiku-20241022
Generate
Top Features by Cosine Similarity
Embeds
Plots
Explanation
Show Test Field
Default Test Text
IFrame
<iframe src=https://www.neuronpedia.org/gpt2-small/6/644?embed=true&embedexplanation=true&embedplots=true&embedtest=true" title="Neuronpedia" style="height: 300px; width: 540px;"></iframe>
Link
https://www.neuronpedia.org/gpt2-small/6/644?embed=true&embedexplanation=true&embedplots=true&embedtest=true
Not in Any Lists
Add to List
▼
No Comments
ADD
Stacked
Snippet
Full
Show Breaks
Hide Breaks
No Known Activations