finding
active
finding:alignment-depth-correlation-with-lift-weakened-from-rho-0-77-n-19-to-rho-0-28-ns-n-28-original-claim-was-overfit

Alignment depth correlation with lift weakened from rho=-0.77 (N=19) to rho=-0.28 NS (N=28); original claim was overfit

Heavy alignment includes both CAI (low lift) and heavy-RLHF (high lift); predictor is alignment type not depth

Source paper

extracted_from
Koan Battery: Measuring Reflective Mode Accessibility in AI
(2026) · Borzov, Anton

Neighborhood — ranked by edge-count

Hypotheses (1)

hypothesis

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.