claim

active

claim:persistence-is-not-an-artifact-of-probe-construction-because-lower-more-central-emotion-pcs-are-more-persistent-than-noisier-high-rank-pcs

Persistence is not an artifact of probe construction because lower (more central) emotion PCs are more persistent than noisier high-rank PCs

Rules out measurement artifact explanation for the persistence finding

Source paper

extracted_from

Persistence and Introspection of Emotion Features

Scott Sauers · Imago · Janus · Antra Tessera

Neighborhood — ranked by edge-count

Findings (1)

finding

Lower (more central) emotion PCs are more persistent than higher (noisier) PCs in both Kimi and Cogito
supports
Rules out that persistence is an artifact of probe construction, since noise dimensions are not similarly persistent

Methods (1)

method

PCA of Emotion Feature Activations
supports
PCA on 171 emotion probe activations across all tokens to produce ordered linear combinations and test if lower PCs are more persistent

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

If persistence is genuinely related to emotion features, lower PCs of the emotion space (more central, less noisy) should be more persistent; if it is an artifact, noisier PCs should have similar persistence.hypothesis0.900
Falsifiability test built into the PC analysis design
Lower (more central) PCs of emotion feature activations are more persistent than higher-rank (noisier) PCs in both Kimi and Cogito, above variance-matched baselines.finding0.835
Supports that persistence is genuinely tied to emotion structure rather than measurement artifact
Emotion probes are more persistent than variance-matched random probes, indicating emotion-specific persistence beyond autoregressive dynamics.claim0.830
Core empirical claim distinguishing emotion persistence from generic high-variance probe persistence
Persistent conversational context that produced emotion-relevant activation is a plausible driver for the observed persistence results.claim0.816
Acknowledged alternative explanation that the paper does not rule out
The relationship between persistence and self-evaluated emotionality serves as a replication of probe-based findings without shared confounds from probe constructionclaim0.816
Claims that agentic self-evaluation provides independent convergent evidence for emotion-persistence link
Whether observed persistence reflects a genuine lingering emotion-like state or merely persistent conversational context that produced the emotion-relevant activationquestion0.815
Core unresolved confound the paper acknowledges but cannot rule out
Persistent conversational context that produced emotion-relevant activations is a plausible driver of observed persistence resultsclaim0.812
Authors' caveat that conversational context persistence rather than internal emotion state persistence could explain findings
PCs of the emotion space and persistenceconcept0.809
Analysis showing that lower-rank (more central) PCs of emotion feature activations are more persistent than higher-rank (noisier) PCs