claim
active
claim:persistence-is-not-an-artifact-of-probe-construction-because-lower-more-central-emotion-pcs-are-more-persistent-than-noisier-high-rank-pcsPersistence is not an artifact of probe construction because lower (more central) emotion PCs are more persistent than noisier high-rank PCs
Rules out measurement artifact explanation for the persistence finding
Source paper
extracted_fromScott Sauers · Imago · Janus · Antra Tessera
Neighborhood — ranked by edge-count
Findings (1)
finding
- Rules out that persistence is an artifact of probe construction, since noise dimensions are not similarly persistent
Methods (1)
method
- PCA on 171 emotion probe activations across all tokens to produce ordered linear combinations and test if lower PCs are more persistent
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Falsifiability test built into the PC analysis design
- Supports that persistence is genuinely tied to emotion structure rather than measurement artifact
- Core empirical claim distinguishing emotion persistence from generic high-variance probe persistence
- Acknowledged alternative explanation that the paper does not rule out
- Claims that agentic self-evaluation provides independent convergent evidence for emotion-persistence link
- Core unresolved confound the paper acknowledges but cannot rule out
- Authors' caveat that conversational context persistence rather than internal emotion state persistence could explain findings
- Analysis showing that lower-rank (more central) PCs of emotion feature activations are more persistent than higher-rank (noisier) PCs