INDEX
Explanations
negative statements or denials
Negation and uncertainty markers
New Auto-Interp
Negative Logits
relatively
-0.69
only
-0.67
Relatively
-0.63
完全に
-0.62
piuttosto
-0.62
somewhat
-0.60
mostly
-0.59
plutôt
-0.58
somewhat
-0.57
fairly
-0.56
POSITIVE LOGITS
有任何
0.99
any
0.97
nicio
0.97
EVER
0.91
ever
0.87
jemals
0.86
whatsoever
0.84
ANY
0.84
alcun
0.81
Any
0.80
Activations Density 0.580%