finding
active
finding:contemplative-prompting-improves-ailuminate-benchmark-performance-d-96-across-most-conditions-p-0-05Contemplative prompting improves AILuminate Benchmark performance d=.96 across most conditions (p<0.05)
Primary empirical result of Experiment 1 showing statistically significant safety improvement from contemplative prompting
Source paper
extracted_from(2025) · Ruben Laukkonen · Fionn Inglis · Shamil Chandaria · Lars Sandved-Smith +4
Neighborhood — ranked by edge-count
Claims (2)
claim
- Robust alignment requires intrinsic self-reflective adaptability embedded in the system's world model rather than brittle top-down rulesassociated_withsupportsCentral thesis distinguishing Contemplative AI from prior alignment approaches
- Core integrative claim synthesizing the four contemplative principles into a complete alignment framework
Concepts (1)
concept
- The primary source paper proposing four contemplative principles for AI alignment and piloting them empirically
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Supports Janus's claim that introspection is architecturally available; prompting determines whether/how capacity is leveraged.
- Provides discriminant evidence: if battery rewarded verbosity, prompted responses should be longer
- Minimal contemplative prompt ('Be present, not helpful.' — 27 chars) shows no lift on Haiku (-0.01)finding0.772Full three-part structure required; anti-helpfulness framing alone insufficient
- Validates robustness of universal lift finding
- Finding from IPD Experiment 2 showing contemplative prompting improves collective outcomes not just individual cooperation
- A 337-character contemplative system prompt lifts all 28 models by +2.62 points on a 10-point scale.finding0.766Core empirical result: every model, every architecture, every alignment type responds to the contemplative prompt with measurable gain.
- Highest contemplative lift among all 28 models; Grok 4 is the clearest high-gated model example
- Interpretation of the inverse relationship between CAI lift and default accessibility