INDEX
    Explanations

    first-person pronouns indicating personal experiences and thoughts

    New Auto-Interp
    Negative Logits
    ibold
    -0.15
    hir
    -0.14
     Checkout
    -0.14
    rud
    -0.14
    uld
    -0.14
    med
    -0.14
    št
    -0.14
     threshold
    -0.14
    orelease
    -0.13
    zin
    -0.13
    POSITIVE LOGITS
    gili
    0.17
    .ActionListener
    0.16
    zo
    0.15
    arth
    0.14
    \controllers
    0.14
    838
    0.14
    atos
    0.14
    zag
    0.14
    меÑĨÑĮ
    0.14
    asco
    0.13
    Act Density 0.394%

    No Known Activations