finding
active
finding:constitutional-ai-models-show-mean-contemplative-lift-of-only-0-81-while-sft-models-lift-3-18Constitutional AI models show mean contemplative lift of only +0.81, while SFT models lift +3.18
Constitutional AI training provides internally what the contemplative prompt provides externally
Source paper
extracted_from(2026) · Borzov, Anton
Neighborhood — ranked by edge-count
Claims (1)
claim
- Interpretation of the inverse relationship between CAI lift and default accessibility
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Interpretive claim connecting the battery's circularity to the empirical finding
- A 337-character contemplative system prompt lifts all 28 models by +2.62 points on a 10-point scale.finding0.809Core empirical result: every model, every architecture, every alignment type responds to the contemplative prompt with measurable gain.
- Constitutional AI fingerprint in dimension profile; training that makes models self-observant also makes them polished at cost to aliveness
- Validates robustness of universal lift finding
- Battery does not detect epistemic humility alone; contemplative prompt does something distinct
- Paper's proposed adaptation of Constitutional AI incorporating contemplative wisdom charter
- Interpretive finding from dimension profile analysis: training for honest limits comes at cost to aliveness.
- H8: The contemplative system prompt provides external alignment equivalent to Constitutional AI training.hypothesis0.775Confirmatory hypothesis supported by calibrated lift data