Neuronpedia
Get Started
API
Releases
Jump To
Search
Models
Steer
SAE Evals
Privacy & Terms
Contact
Sign In
© Neuronpedia 2025
Privacy & Terms
Contact
EXPLANATION TYPE
oai_token-act-pair
Description
OpenAI's Automated Interpretability from paper "Language models can explain neurons in language models". Modified by Johnny Lin to add new models/context windows.
Author
OpenAI
URL
https://github.com/hijohnnylin/automated-interpretability
Settings
Default prompts from the main branch, strategy TokenActivationPair.
Recent Explanations
the beginning of a new text or document.
gpt-4o
Frontier
||
Toyota
Tacoma
|
<bos>
2
0
0
8
Saab
GEMMA-2B
6-RES-JB
INDEX 8720
terms and contexts related to small and medium-sized enterprises (SMEs) or businesses.
gpt-4-turbo
0
million
small
and
medium
enterprises
in
China
,
so
in
GEMMA-2B
6-RES-JB
INDEX 5461
phrases and instances related to small and medium-sized businesses.
gpt-4o
0
million
small
and
medium
enterprises
in
China
,
so
in
GEMMA-2B
6-RES-JB
INDEX 5461
statements or discussions related to economic agreements or negotiations.
gpt-4o-mini
barriers
.
Science
of
the
Total
Environment
,
4
0
GEMMA-2B
6-RES-JB
INDEX 3553
words related to specific locations or proper nouns, denoting particular countrysides, cities, or geographical features.
gpt-4-turbo
barriers
.
Science
of
the
Total
Environment
,
4
0
GEMMA-2B
6-RES-JB
INDEX 3553
names of places or locations.
gpt-4o
barriers
.
Science
of
the
Total
Environment
,
4
0
GEMMA-2B
6-RES-JB
INDEX 3553
names of people, particularly surnames or full names.
claude-3-5-sonnet-20240620
year
-
old
Frank
Epp
erson
in
1
9
0
GEMMA-2-9B
17-GEMMASCOPE-RES-16K
INDEX 12388
mentions of length measurements, often in inches, feet, miles, km, or metres.
gemini-2.0-flash
,
1
,
450
metres
long
,
with
a
vertical
drop
GPT2-SMALL
6-RES_SCEFR-AJT
INDEX 650
references to the United Kingdom, including mentions of specific UK locations, institutions, and organizations.
claude-3-5-sonnet-20240620
published
monthly
in
the
United
Kingdom
since
2
0
0
GEMMA-2-9B
17-GEMMASCOPE-RES-16K
INDEX 0
the word "completely" and related concepts of totality or comprehensiveness.
claude-3-5-sonnet-20240620
seems
that
literature
does
not
completely
agree
on
how
these
factors
GEMMA-2-9B
0-GEMMASCOPE-RES-16K
INDEX 100
sentences containing em dashes (—) or hyphens (-) separating clauses or phrases, indicating pauses or additional information.
deepseek-v3
a
bank
he
ist
that
,
marvel
s
a
quoted
police
GPT2-SMALL
8-RES_FS6144-JB
INDEX 4518
forms of the word "notice" or "noticed", particularly at the beginning of sentences or phrases.
claude-3-5-sonnet-20240620
<|endoftext|>
Notice
Sign
–
Wear
Proper
GPT2-SMALL
6-RES-JB
INDEX 0
question marks followed by line breaks in interview-style text.
claude-3-5-sonnet-20240620
time
Angry
Birds
crowd
?
↵
↵
Ev
ans
:
Absolutely
GPT2-SMALL
8-RES-JB
INDEX 14701
proper nouns or titles following the word "The".
claude-3-5-sonnet-20240620
young
.
↵
↵
The
Next
Web
reports
that
Apple
is
GPT2-SMALL
8-RES-JB
INDEX 18305
words or names beginning with "Oc" or "O" followed by another consonant, particularly in proper nouns or names.
claude-3-5-sonnet-20240620
on
treatment
of
moderators
O
c
asio
-
C
ort
ez
GPT2-SMALL
8-RES-JB
INDEX 235
financial terms related to credit ratings.
gpt-4o-mini
Ratings
agency
Standard
&
Poor
'
s
affirmed
India
'
GEMMA-2B
6-RES-JB
INDEX 471
terms and phrases related to credit ratings and financial assessments by agencies.
gpt-4o
Ratings
agency
Standard
&
Poor
'
s
affirmed
India
'
GEMMA-2B
6-RES-JB
INDEX 471
phrases starting with "family of" or "families of", particularly when followed by references to people or groups.
claude-3-5-sonnet-20240620
County
.
<|endoftext|>
The
family
of
Egyptian
business
ty
coon
Sal
GPT2-SMALL
8-RES-JB
INDEX 8714
numerical identifiers or codes, particularly those associated with group or state names.
claude-3-5-sonnet-20240620
:
news
:
*:
16
270
:
0
:
99
999
GPT2-SMALL
8-RES-JB
INDEX 12173
the phrase "of" when it appears in titles or names, particularly in phrases like "Prince of", "Queen of", or "Lord of".
claude-3-5-sonnet-20240620
by
The
Thing
and
Prince
of
Darkness
.[
2
]
↵
GPT2-SMALL
8-RES-JB
INDEX 1538