claim
active
claim:emotion-features-are-not-strictly-locally-scoped-they-are-bursty-with-a-long-tail-of-slow-change-persisting-over-100-tokensEmotion features are not strictly locally scoped; they are bursty with a long tail of slow change persisting over 100 tokens.
Main conclusion about the temporal dynamics of emotion features
Source paper
extracted_fromScott Sauers · Imago · Janus · Antra Tessera
Neighborhood — ranked by edge-count
Papers (1)
paper
Findings (2)
finding
- Demonstrates emotion-specific persistence beyond variance effects in Cogito
- Demonstrates long-tail persistence of causal steering effect in a subset of emotion features
Hypotheses (1)
hypothesis
- Open hypothesis from the Anthropic paper that motivates this work
Claims (2)
claim
- Characterizes the temporal dynamics of emotion feature activation in LLMs
- Acknowledged alternative explanation that the paper does not rule out
Questions (1)
question
- The central research question motivating the paper
Quotes (1)
quote
- Core summary of the paper's main empirical conclusion about emotion feature dynamics
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Interpretive hypothesis offered to explain why emotion features are more persistent
- Central interpretive claim of the paper supported by multiple convergent analyses
- Proposed mechanistic explanation for why emotion features are more persistent
- Falsifiability test built into the PC analysis design
- Core empirical claim distinguishing emotion persistence from generic high-variance probe persistence
- Core open question the paper raises but does not fully resolve
- Proposed explanation for why emotion probes are more persistent than variance-matched random probes
- Question raised by Anthropic and partially addressed by this paper's persistence evidence
Restated by (1)
cosine ≥ 0.90Other entities that say roughly the same thing. May be merge candidates or independent restatements across papers.