method
active
method:benjamini-hochberg-fdr-correctionBenjamini-Hochberg FDR correction
Multiple testing correction applied to significance tests of emotion persistence and self-evaluation word associations
Neighborhood — ranked by edge-count
Findings (1)
finding
- Shows immediate causal effect of steering on emotion feature activation
Methods (3)
method
- Benjamini-Hochberg (BH) correctionrelated_toApplied within concept/endpoint families to control false discovery rate across parallel tests
- Per-(emotion, token) z-score computed as injected emotion activation minus mean of 170 other probes, contrasted against no-steering baseline
- Tests whether SAE features whose self-evaluation transcripts mention a specific emotion word have higher cosine similarity to that emotion probe
Related by similarity (4)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Multiple comparisons correction applied to Wilcoxon p-values for the Strange Stories task with three score categories.
- The adaptive, incremental nature of living process, allowing small steps with continuous evaluation and adjustment.
- Fine-tuning on Claude-generated self-correction examples with loss masking to induce ESR-like behavior
- UCCT's theoretical prediction about how fine-tuning maps onto the anchoring score