claim
active
claim:logit-based-self-report-unmasks-introspective-capacity-that-greedy-decoding-conceals

Logit-based self-report unmasks introspective capacity that greedy decoding conceals

Central methodological contribution: computing probability-weighted expected value over digit-token logits recovers continuous, informative signal

Source paper

extracted_from
Quantitative Introspection in Language Models: Tracking Emotive States Across Conversation
(2026) · Nicolas Martorell · Bianchi, Bruno

Neighborhood — ranked by edge-count

Findings (2)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.