INDEX
Explanations
references to food and culinary experiences
First-person singular pronoun "I" or "'m" or "is"
expressing thoughts and feelings
New Auto-Interp
Negative Logits
فريبيس
-0.91
TacToe
-0.89
/>";
-0.85
]-->
-0.84
/>";
-0.80
'\\;'
-0.78
OGND
-0.75
]<<"
-0.72
Ведь
-0.72
[]):
-0.70
POSITIVE LOGITS
fucking
0.81
weirdly
0.78
goddamn
0.74
apparently
0.73
fucking
0.71
FUCKING
0.68
vaguely
0.67
ostensibly
0.66
legitimately
0.65
variously
0.64
Activations Density 0.245%