finding
active
finding:interest-probe-score-drifts-positively-across-turns-lmm-slope-0-005-p-4-12-10-14-in-llama-3-2-3b

Interest probe score drifts positively across turns: LMM slope=0.005, p=4.12×10⁻¹⁴ in LLaMA-3.2-3B

Demonstrates genuine internal-state dynamics in LLMs during multi-turn conversation

Source paper

extracted_from
Quantitative Introspection in Language Models: Tracking Emotive States Across Conversation
(2026) · Nicolas Martorell · Bianchi, Bruno

Neighborhood — ranked by edge-count

Concepts (1)

concept
  • Directions in activation space associated with contrastive emotive concept pairs studied in this paper as targets for introspection

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.