INDEX
Explanations
programming syntax and structure specific to coding languages
Code snippets ending with specific punctuation
closing braces and return statements
New Auto-Interp
Negative Logits
[
-0.65
[
-0.60
(
-0.59
',
-0.59
'){
-0.56
"
-0.55
\[
-0.55
",
-0.53
++
-0.53
++){
-0.53
POSITIVE LOGITS
}
1.18
//}
0.97
;}
0.87
}
0.87
};
0.86
.}
0.81
↵↵↵
0.78
}.
0.77
return
0.76
}*/
0.75
Activations Density 0.108%