INDEX
Explanations
details related to physical displacement or movement in a spatial context
New Auto-Interp
Negative Logits
ringe
-0.16
olson
-0.15
utz
-0.15
enna
-0.14
iveau
-0.14
ommen
-0.14
itur
-0.13
ieme
-0.13
Ĥæķ°
-0.13
ibold
-0.13
POSITIVE LOGITS
opposite
0.22
direction
0.18
Directions
0.18
egative
0.17
positive
0.16
lug
0.16
Positive
0.16
polarity
0.16
directions
0.16
æĸ¹åIJij
0.16
Activations Density 0.217%