question
active
question:when-self-report-changes-significantly-while-a-linear-probe-stays-flat-is-the-probe-misspecified-or-the-self-report-spurious

When self-report changes significantly while a linear probe stays flat, is the probe misspecified or the self-report spurious?

Key interpretive question the framework helps address through convergent validation logic

Source paper

extracted_from
Quantitative Introspection in Language Models: Tracking Emotive States Across Conversation
(2026) · Nicolas Martorell · Bianchi, Bruno

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.