Neuronpedia

APISteer SAE Evals Blog Slack Privacy & Terms Contact

© Neuronpedia 2025

Privacy & Terms Blog/RSS GitHub Slack Twitter Contact

Home
GPT2-Small
11
2175

INDEX

Explanations

'ven' in 'vene' and 'infring' in 'infringe'

words related to violation or infringement and their derivatives.

oai_token-act-pair · gpt-4-turbo

token before “e” or” “es” at the end of a word, especially “infing” in “infringes”

New Auto-Interp

Top Features by Cosine Similarity

Embeds

PlotsExplanationShow Test FieldDefault Test Text

IFrame

Link

Not in Any Lists

No Comments

No Known Activations