Neuronpedia

APISteer SAE Evals Blog/PodcastNEW Slack Privacy & Terms Contact

© Neuronpedia 2025

Privacy & Terms Blog/Podcast GitHub Slack Twitter Contact

Home
GPT2-Small
6
211

INDEX

Explanations

the word 'how' in explanation

the word 'how' can be similar to others but i insist it is.

the word 'how'

" How", regardless of spacing or capitalization

"how", "what", and "understand"

How

phrases about the way things work, especially “how”

phrases related to understanding or explaining concepts.

oai_token-act-pair · gpt-4-turbo

mechanics of how things work

Explanations

ddddd

New Auto-Interp

Top Features by Cosine Similarity

Embeds

PlotsExplanationShow Test FieldDefault Test Text

IFrame

Link

Not in Any Lists

No Comments

No Known Activations