finding
active
finding:in-cogito-v2-1-average-residual-persistence-above-variance-matched-probes-is-0-077-p-1-5e-27-157-of-171-probes-positiveIn Cogito v2.1, average residual persistence above variance-matched probes is +0.077 (p = 1.5e-27, 157 of 171 probes positive).
Demonstrates emotion-specific persistence beyond variance effects in Cogito
Source paper
extracted_fromScott Sauers · Imago · Janus · Antra Tessera
Neighborhood — ranked by edge-count
Claims (2)
claim
- Emotion probes are more persistent than variance-matched random probes, indicating emotion-specific persistence beyond autoregressive dynamics.associated_withsupportsCore empirical claim distinguishing emotion persistence from generic high-variance probe persistence
- Main conclusion about the temporal dynamics of emotion features
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Demonstrates that Cogito emotion probes are persistently active beyond what is explained by their variance alone
- Emotion probe persistence correlation of 0.214 in Cogito v2.1 vs 0.099 for random vectorsfinding0.842Quantifies emotion feature persistence above random baseline in Cogito across 240 multi-turn conversations
- Quantitative measure of emotion feature persistence vs random baseline in Cogito
- Strong positive relationship between emotion alignment and SAE feature persistence in Cogito
- SAE feature emotion subspace overlap correlates with persistence in Cogito: Spearman +0.413, p=4.4e-196finding0.788Demonstrates that SAE features more aligned with the emotion subspace are more persistent in Cogito after variance control
- Core result of Experiment 3: cross-model semantic convergence under self-referential processing
- Mechanistic evidence that network actively attenuates injected perturbations, explaining late-layer introspection failure
- Probe achieves selectivity of 4.20 on pythia-410m, slightly exceeding DAS selectivity of 3.96finding0.742Key result showing that for models larger than pythia-70m, probe selectivity matches or exceeds DAS selectivity