INDEX
Explanations
discussions about extremes and balance
New Auto-Interp
Negative Logits
imas
-0.17
ceil
-0.16
olas
-0.16
vault
-0.15
oola
-0.15
Giang
-0.15
ÏģοÏħ
-0.14
inh
-0.14
elight
-0.14
engin
-0.13
POSITIVE LOGITS
intermediate
0.54
middle
0.47
Intermediate
0.45
Intermediate
0.42
middle
0.40
intermediary
0.40
intermedi
0.37
somewhere
0.37
midd
0.37
between
0.36
Activations Density 0.206%