INDEX
Explanations
comparisons between different entities or concepts
comparative phrases involving the word "versus."
New Auto-Interp
Negative Logits
olog
-0.83
unes
-0.81
shire
-0.79
obal
-0.77
seed
-0.76
lied
-0.75
ocrine
-0.74
omical
-0.74
ERN
-0.72
Dise
-0.71
POSITIVE LOGITS
hill
0.73
pecting
0.66
nil
0.65
mindset
0.63
LCD
0.62
await
0.60
expecting
0.60
bureaucratic
0.59
trusting
0.59
ugly
0.58
Activations Density 0.015%