INDEX
Explanations
instances of past and present tense verbs, particularly focusing on actions and conditions
New Auto-Interp
Negative Logits
bon
-0.16
ĥ
-0.14
زة
-0.14
istani
-0.14
ahr
-0.14
notice
-0.14
eller
-0.13
endant
-0.13
ìϏ
-0.13
Shell
-0.13
POSITIVE LOGITS
itself
0.17
gies
0.17
egra
0.16
lets
0.16
YLES
0.15
hone
0.15
ivals
0.15
letics
0.14
mac
0.14
Ñħи
0.14
Activations Density 0.416%