INDEX
    Explanations

    phrases related to the concept of "first" and "last" across various contexts

    New Auto-Interp
    Negative Logits
     cauza
    -0.58
    évaluateur
    -0.56
    brigens
    -0.55
     sisält
    -0.53
     lainnya
    -0.52
     elsewhere
    -0.51
     altrimenti
    -0.49
     autrement
    -0.47
     últimas
    -0.47
     Hindus
    -0.47
    POSITIVE LOGITS
    ever
    0.98
     ever
    0.93
     major
    0.85
     iteration
    0.83
     installment
    0.82
     round
    0.81
     leg
    0.80
     batch
    0.80
     step
    0.78
     big
    0.77
    Act Density 0.209%

    No Known Activations