INDEX
Explanations
complex arguments and perspectives in academic discourse
New Auto-Interp
Negative Logits
elucid
-0.20
Narr
-0.15
urv
-0.14
oplay
-0.14
Narrative
-0.14
Trot
-0.14
Talks
-0.14
Lists
-0.14
ottie
-0.14
åĢ«
-0.13
POSITIVE LOGITS
exam
0.33
examine
0.31
examines
0.29
examination
0.29
explores
0.27
explore
0.27
examined
0.26
chart
0.25
Examination
0.25
consideration
0.24
Activations Density 0.118%