Neuronpedia

APISteer SAE Evals Blog/PodcastNEW Slack Privacy & Terms Contact

© Neuronpedia 2025

Privacy & Terms Blog/Podcast GitHub Slack Twitter Contact

Home
GPT2-Small
6
644

INDEX

Explanations

Contractions ending with “‘d be”

phrases indicated hypotheticals (eg "wouldn't be", "it'd be", "I'd be", etc)

Would or would not be

the contraction ‘d or the phrase “wouldn’t be”

contractions with "be" following

contractions with the base verb "would."

oai_token-act-pair · gpt-4-turbo

Statements about hypotheticals or counterfactuals

New Auto-Interp

Top Features by Cosine Similarity

Embeds

PlotsExplanationShow Test FieldDefault Test Text

IFrame

Link

Not in Any Lists

No Comments

No Known Activations