INDEX
    Explanations

    phrases indicating a comparison or contrast

    instances of the word "that" indicating referencing or elaboration

    New Auto-Interp
    Negative Logits
    Jam
    -0.80
    Bow
    -0.79
     Corpus
    -0.73
    CBC
    -0.72
    Pri
    -0.72
    MAP
    -0.69
    christ
    -0.68
    kay
    -0.67
    hip
    -0.67
    Gy
    -0.66
    POSITIVE LOGITS
     haun
    0.81
    yip
    0.78
     transc
    0.74
     consumes
    0.74
    ivia
    0.71
    oker
    0.71
     perme
    0.70
    ItemTracker
    0.70
     resolves
    0.68
    warts
    0.68
    Act Density 0.075%

    No Known Activations