finding

active

finding:constitutional-ai-models-show-mean-contemplative-lift-of-only-0-81-while-sft-models-lift-3-18

Constitutional AI models show mean contemplative lift of only +0.81, while SFT models lift +3.18

Constitutional AI training provides internally what the contemplative prompt provides externally

Source paper

extracted_from

Koan Battery: Measuring Reflective Mode Accessibility in AI

(2026) · Borzov, Anton

Neighborhood — ranked by edge-count

Claims (1)

claim

The contemplative system prompt provides externally what Constitutional AI alignment training provides internally.
supports
Interpretation of the inverse relationship between CAI lift and default accessibility

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Constitutional AI explicitly trains self-observation-like behavior, which is why CAI models score highest and show lowest contemplative lift.claim0.853
Interpretive claim connecting the battery's circularity to the empirical finding
A 337-character contemplative system prompt lifts all 28 models by +2.62 points on a 10-point scale.finding0.809
Core empirical result: every model, every architecture, every alignment type responds to the contemplative prompt with measurable gain.
All three Claude models show high boundary_awareness and low aesthetic_response relative to own means — distinctive Constitutional AI signaturefinding0.798
Constitutional AI fingerprint in dimension profile; training that makes models self-observant also makes them polished at cost to aliveness
Bootstrap 95% CI for mean contemplative lift: +2.62 [2.16, 2.90]; baseline rank concordance under perturbation: 0.909; top-5 stability: 89.6%finding0.793
Validates robustness of universal lift finding
Epistemic humility prompt yields mean lift of only +0.84 vs contemplative +2.27; contemplative is 2.7x the uncertainty liftfinding0.788
Battery does not detect epistemic humility alone; contemplative prompt does something distinct
Contemplative Constitutional AIframework0.779
Paper's proposed adaptation of Constitutional AI incorporating contemplative wisdom charter
Constitutional AI produces a distinctive signature: high boundary_awareness, low aesthetic_response relative to peers.claim0.776
Interpretive finding from dimension profile analysis: training for honest limits comes at cost to aliveness.
H8: The contemplative system prompt provides external alignment equivalent to Constitutional AI training.hypothesis0.775
Confirmatory hypothesis supported by calibrated lift data