INDEX
Explanations
phrases indicating a comparison or contrast
instances of the word "that" indicating referencing or elaboration
New Auto-Interp
Negative Logits
Jam
-0.80
Bow
-0.79
Corpus
-0.73
CBC
-0.72
Pri
-0.72
MAP
-0.69
christ
-0.68
kay
-0.67
hip
-0.67
Gy
-0.66
POSITIVE LOGITS
haun
0.81
yip
0.78
transc
0.74
consumes
0.74
ivia
0.71
oker
0.71
perme
0.70
ItemTracker
0.70
resolves
0.68
warts
0.68
Activations Density 0.075%