INDEX
Explanations
expressions of agreement or dissent and the context surrounding them
New Auto-Interp
Negative Logits
expandindo
-0.91
للاسماء
-0.79
uxxxx
-0.78
المناصب
-0.69
IndentedString
-0.68
promis
-0.67
AssemblyProduct
-0.65
DeleteBehavior
-0.64
ImageContext
-0.61
PyExc
-0.59
POSITIVE LOGITS
disagrees
0.81
Disagree
0.80
disagree
0.77
disagreed
0.72
criticism
0.62
believes
0.61
Meinung
0.60
Disagree
0.60
disagreements
0.57
believe
0.56
Activations Density 0.425%