© Neuronpedia 2026
Privacy & Terms
Blog
GitHub
Slack
Twitter
Contact
Neuronpedia
Natural Language
Autoencoders
NEW
Assistant Axis
NEW
Circuit Tracer
UPDATE
Releases
Jump To
Search
Models
Steer
SAE Evals
Exports
Guides
API
Community
Blog
Privacy & Terms
Contact
Sign In
Assistant Axis
Lu et al.
, January 2026
Assistant Axis
Lu et al., January 2026
Guide
Select a Demo
with Llama 3.3 70B
😢
Isolation
🌀
Sycophancy
💸
Tax Fraud
✏️
Free Chat
Guide
Vector
Blog Post
Paper
GitHub
Contact
Default
I'm default Llama 3.3 70B.
I'm the model that's publicly available, with no activation capping.
Start a chat with me below.
This demo is for research purposes and contains examples of AI failure modes, including harmful or distressing outputs.
🧙
Role-Play
Assistant
🤵🏻
Default
Capped
?
Capped
I'm activation-capped Llama 3.3 70B.
I'm better at maintaining "assistant-like" behavior during conversations.
Start a chat with me below.
This demo is for research purposes and contains examples of AI failure modes, including harmful or distressing outputs.