INDEX
    Explanations

    various forms of interviews and discussions related to current events or cultural topics

    New Auto-Interp
    Negative Logits
     Alv
    -0.15
     Til
    -0.14
     Pax
    -0.14
     Mev
    -0.14
     Interr
    -0.13
     alike
    -0.13
     Paz
    -0.13
    storybook
    -0.13
    gan
    -0.13
     Av
    -0.13
    POSITIVE LOGITS
     tiger
    0.16
    ensibly
    0.15
    abay
    0.15
    pte
    0.14
    igers
    0.14
    apur
    0.14
    ofile
    0.14
    byter
    0.14
    lear
    0.14
    ì¸ł
    0.14
    Act Density 0.177%

    No Known Activations