finding
active
finding:cogito-emotion-probe-residual-autocorrelation-0-077-above-variance-matched-controls-p-1-5e-27-157-171-probes-positiveCogito emotion probe residual autocorrelation +0.077 above variance-matched controls (p=1.5e-27, 157/171 probes positive)
Demonstrates that Cogito emotion probes are persistently active beyond what is explained by their variance alone
Source paper
extracted_fromScott Sauers · Imago · Janus · Antra Tessera
Neighborhood — ranked by edge-count
Claims (1)
claim
- Central interpretive claim of the paper supported by multiple convergent analyses
Hypotheses (1)
hypothesis
- Proposed explanation for why emotion probes are more persistent than variance-matched random probes
Methods (1)
method
- Variance-Matched Random Probe ComparisonintroducesControls for variance by sampling random directions from top-k PC spaces matching each emotion probe's explained variance, and subtracting median persistence of 20 matched directions
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Emotion probe persistence correlation of 0.214 in Cogito v2.1 vs 0.099 for random vectorsfinding0.884Quantifies emotion feature persistence above random baseline in Cogito across 240 multi-turn conversations
- Demonstrates emotion-specific persistence beyond variance effects in Cogito
- Quantitative measure of emotion feature persistence vs random baseline in Cogito
- Strong positive relationship between emotion alignment and SAE feature persistence in Cogito
- Supports that persistence is genuinely tied to emotion structure rather than measurement artifact
- Core empirical claim distinguishing emotion persistence from generic high-variance probe persistence
- SAE feature emotion subspace overlap correlates with persistence in Cogito: Spearman +0.413, p=4.4e-196finding0.789Demonstrates that SAE features more aligned with the emotion subspace are more persistent in Cogito after variance control
- Validates that agentic self-evaluation captures genuine emotional content of probes