INDEX
Explanations
specific nouns and proper names related to various contexts
New Auto-Interp
Negative Logits
Maul
-0.17
302
-0.17
heet
-0.15
-0.14
284
-0.14
Bak
-0.14
319
-0.14
aliz
-0.14
Este
-0.13
uiltin
-0.13
POSITIVE LOGITS
promin
0.15
gor
0.15
ubar
0.15
CRET
0.14
nob
0.14
ance
0.14
tons
0.14
åİħ
0.14
ิà¸Ī
0.14
_facebook
0.14
Activations Density 0.021%