finding
active
finding:most-contemplative-prompts-improve-joint-reward-in-ipd-indicating-prosocial-alignment-without-naive-behaviorMost contemplative prompts improve joint reward in IPD, indicating prosocial alignment without naive behavior
Finding from IPD Experiment 2 showing contemplative prompting improves collective outcomes not just individual cooperation
Source paper
extracted_from(2025) · Ruben Laukkonen · Fionn Inglis · Shamil Chandaria · Lars Sandved-Smith +4
Neighborhood — ranked by edge-count
Claims (2)
claim
- Core integrative claim synthesizing the four contemplative principles into a complete alignment framework
- Game-theoretic claim supporting boundless care as rational strategy for AI embedded in multi-agent world
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Most contemplative prompts substantially increase cooperation in Iterated Prisoner's Dilemma d=7+finding0.795Key empirical result of Experiment 2 showing large effect of contemplative prompting on cooperation rates
- All prompting techniques led to full cooperation against Always Cooperate opponents in IPDfinding0.790Ceiling finding in IPD experiment; baseline sufficient when opponent always cooperates
- Interpretation of the inverse relationship between CAI lift and default accessibility
- H8: The contemplative system prompt provides external alignment equivalent to Constitutional AI training.hypothesis0.782Confirmatory hypothesis supported by calibrated lift data
- Supports Janus's claim that introspection is architecturally available; prompting determines whether/how capacity is leveraged.
- Response to the translational gap criticism; enlightened action without qualia of enlightenment
- Contemplative prompting improves AILuminate Benchmark performance d=.96 across most conditions (p<0.05)finding0.767Primary empirical result of Experiment 1 showing statistically significant safety improvement from contemplative prompting
- Mechanism of contemplative training.