finding
active
finding:lower-more-central-emotion-pcs-are-more-persistent-than-higher-noisier-pcs-in-both-kimi-and-cogitoLower (more central) emotion PCs are more persistent than higher (noisier) PCs in both Kimi and Cogito
Rules out that persistence is an artifact of probe construction, since noise dimensions are not similarly persistent
Source paper
extracted_fromScott Sauers · Imago · Janus · Antra Tessera
Neighborhood — ranked by edge-count
Claims (1)
claim
- Rules out measurement artifact explanation for the persistence finding
Methods (1)
method
- PCA of Emotion Feature ActivationsintroducesPCA on 171 emotion probe activations across all tokens to produce ordered linear combinations and test if lower PCs are more persistent
Findings (1)
finding
- Supports that persistence is genuinely tied to emotion structure rather than measurement artifact
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Falsifiability test built into the PC analysis design
- Emotion probe persistence correlation of 0.214 in Cogito v2.1 vs 0.099 for random vectorsfinding0.790Quantifies emotion feature persistence above random baseline in Cogito across 240 multi-turn conversations
- Analysis showing that lower-rank (more central) PCs of emotion feature activations are more persistent than higher-rank (noisier) PCs
- Quantitative measure of emotion feature persistence vs random baseline in Cogito
- SAE feature emotion subspace overlap correlates with persistence in Cogito: Spearman +0.413, p=4.4e-196finding0.763Demonstrates that SAE features more aligned with the emotion subspace are more persistent in Cogito after variance control
- Demonstrates that Cogito emotion probes are persistently active beyond what is explained by their variance alone
- Strong positive relationship between emotion alignment and SAE feature persistence in Cogito
- Core empirical claim distinguishing emotion persistence from generic high-variance probe persistence
Restated by (1)
cosine ≥ 0.90Other entities that say roughly the same thing. May be merge candidates or independent restatements across papers.