Neuronpedia

APISteer SAE Evals Blog/PodcastNEW Slack Privacy & Terms Contact

© Neuronpedia 2025

Privacy & Terms Blog/Podcast GitHub Slack Twitter Contact

Home
GPT2-Small
6
765

INDEX

Explanations

factoring in or together

Math verbs like add or multiplying

use of simple mathematical terms in common language, like adding and subtracting

terms and phrases related to mathematics, e.g. operations (factor, add, subtract, etc.)

Math instructions in word problems

numbers, related calculations, and words associated with comparison.

oai_token-act-pair · gpt-4-turbo

New Auto-Interp

Top Features by Cosine Similarity

Embeds

PlotsExplanationShow Test FieldDefault Test Text

IFrame

Link

Not in Any Lists

No Comments

No Known Activations