claim
active
claim:emotion-refers-to-a-state-concept-so-stateful-representations-in-general-may-be-more-persistent-across-tokensEmotion refers to a state concept, so stateful representations in general may be more persistent across tokens.
Interpretive hypothesis offered to explain why emotion features are more persistent
Source paper
extracted_fromScott Sauers · Imago · Janus · Antra Tessera
Neighborhood — ranked by edge-count
Papers (1)
paper
Claims (2)
claim
- Proposed mechanistic explanation for why emotion features are more persistent
- Core empirical claim distinguishing emotion persistence from generic high-variance probe persistence
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Proposed explanation for why emotion probes are more persistent than variance-matched random probes
- Main conclusion about the temporal dynamics of emotion features
- Central interpretive claim of the paper supported by multiple convergent analyses
- Characterizes the temporal dynamics of emotion feature activation in LLMs
- Question raised by Anthropic and partially addressed by this paper's persistence evidence
- We hypothesize that persistently active emotional state representations exist in LLMs but may be missed by standard probing methods.hypothesis0.800Open hypothesis from the Anthropic paper that motivates this work
- Core open question the paper raises but does not fully resolve
- Falsifiability test built into the PC analysis design
Restated by (1)
cosine ≥ 0.90Other entities that say roughly the same thing. May be merge candidates or independent restatements across papers.