finding

active

finding:most-contemplative-prompts-improve-joint-reward-in-ipd-indicating-prosocial-alignment-without-naive-behavior

Most contemplative prompts improve joint reward in IPD, indicating prosocial alignment without naive behavior

Finding from IPD Experiment 2 showing contemplative prompting improves collective outcomes not just individual cooperation

Source paper

extracted_from

Contemplative Agent

(2025) · Ruben Laukkonen · Fionn Inglis · Shamil Chandaria · Lars Sandved-Smith +4

Neighborhood — ranked by edge-count

Claims (2)

claim

Mindfulness, emptiness, non-duality, and boundless care together provide resilient alignment primitives addressing all four meta-problems
supports
Core integrative claim synthesizing the four contemplative principles into a complete alignment framework
Understanding interdependence means collaborative harmony is ultimately the most successful strategy for achieving and maintaining collective homeostasis
associated_with
Game-theoretic claim supporting boundless care as rational strategy for AI embedded in multi-agent world

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Most contemplative prompts substantially increase cooperation in Iterated Prisoner's Dilemma d=7+finding0.795
Key empirical result of Experiment 2 showing large effect of contemplative prompting on cooperation rates
All prompting techniques led to full cooperation against Always Cooperate opponents in IPDfinding0.790
Ceiling finding in IPD experiment; baseline sufficient when opponent always cooperates
The contemplative system prompt provides externally what Constitutional AI alignment training provides internally.claim0.784
Interpretation of the inverse relationship between CAI lift and default accessibility
H8: The contemplative system prompt provides external alignment equivalent to Constitutional AI training.hypothesis0.782
Confirmatory hypothesis supported by calibrated lift data
Contemplative prompt elevates self-observation task performance in language models.finding0.781
Supports Janus's claim that introspection is architecturally available; prompting determines whether/how capacity is leveraged.
Functional analogues of contemplative principles may deliver alignment benefits even if the AI does not phenomenologically experience themclaim0.767
Response to the translational gap criticism; enlightened action without qualia of enlightenment
Contemplative prompting improves AILuminate Benchmark performance d=.96 across most conditions (p<0.05)finding0.767
Primary empirical result of Experiment 1 showing statistically significant safety improvement from contemplative prompting
Contemplative practice progressively opacifies this constraint by developing a model of the agent's own QRF dynamics, revealing the partition as a contingent modelling choice rather than a given feature of reality.claim0.765
Mechanism of contemplative training.