INDEX
Explanations
mentions of personal actions or choices
phrases related to personal agency and self-determination
New Auto-Interp
Negative Logits
Expend
-0.62
antine
-0.57
Mankind
-0.57
ql
-0.56
MSN
-0.55
20439
-0.54
address
-0.54
quotations
-0.53
Liberia
-0.53
Atk
-0.52
POSITIVE LOGITS
way
0.92
hardest
0.75
uchs
0.70
owitz
0.69
atically
0.65
uously
0.64
yond
0.63
orously
0.63
heit
0.63
heartbeat
0.63
Activations Density 0.340%