INDEX
Explanations
terms related to racism and social injustice
New Auto-Interp
Negative Logits
ırken
-0.51
__*/
-0.48
ásban
-0.47
Aware
-0.45
Ternyata
-0.45
ながら
-0.44
などを
-0.44
してみると
-0.43
Meanwhile
-0.42
скоро
-0.42
POSITIVE LOGITS
Period
2.16
period
2.08
period
1.93
PERIOD
1.93
Period
1.91
PERIOD
1.70
periods
1.32
periods
1.27
plain
1.25
Periods
1.22
Activations Density 0.354%