concept
active
concept:emotion-feature-persistenceemotion feature persistence
The phenomenon that emotion feature activations remain elevated above baseline beyond local token bursts, measurable as long-range correlation
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (4)
concept
- Anti-Persistence of Emotion Featuresrelated_toThe phenomenon where activating an emotion feature leads to subsequent below-baseline activation of that feature
- The observed pattern of fast initial spike in emotion activation followed by slow decay to a persistent elevated baseline, characterizing emotion feature dynamics
- internal emotional stateassociated_withThe possibility of a stably encoded, causally active emotional state within LLMs, as distinct from token-by-token semantic content
- Analysis showing that lower-rank (more central) PCs of emotion feature activations are more persistent than higher-rank (noisier) PCs
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The property of emotion features maintaining elevated activation well beyond the local token context that triggered them
- Core open question the paper raises but does not fully resolve
- Falsifiability test built into the PC analysis design
- Core unresolved confound the paper acknowledges but cannot rule out
- Emotion feature persistence above and beyond the persistence expected from high variance explained alone, computed by subtracting median variance-matched probe persistence
- Main conclusion about the temporal dynamics of emotion features
- Acknowledged alternative explanation that the paper does not rule out
- Interpretive hypothesis offered to explain why emotion features are more persistent