thinker:ruben-laukkonenRuben Laukkonen
Authored papers (1)
Embedding four Buddhist-derived axiomatic principles—mindfulness, emptiness, non-duality, and boundless care—into AI systems via a framework the paper terms the 'Wise World Model' produces measurable alignment gains and cooperation boosts in current transformer-based LLMs. Pilot experiments on GPT-4o and GPT-4.1 nano using structured contemplative prompts yielded statistically significant safety improvements across ten hazard categories on the AILuminate Benchmark (d=.96 against baseline standard prompting), and drove cooperation rates and joint reward substantially upward in an Iterated Prisoner's Dilemma across 50 simulated 10-round games (d=7+), with boundless-care and non-duality prompts producing the largest effects even against always-defecting opponents. Three implementation pathways are introduced—Contemplative Architecture (full-stack active inference embedding), Contemplative Constitutional AI (CCAI, extending Anthropic's Constitutional AI framework with a 'wisdom charter'), and Contemplative Reinforcement Learning (CRL) on chain-of-thought—each targeting different integration depths from generative-model parameters to inference-time classifiers. The paper argues that because these principles restructure how goals, beliefs, and self-other boundaries are encoded rather than prescribing what specific values to hold, they provide scale-resilient intrinsic alignment that does not degrade as AI capability outstrips human oversight—contrasting with extrinsic methods like RLHF or rule-based constraints that become gameable at superintelligent scales.
More papers — OpenAlex / S2
Studies (2)
Affiliations (2)
- LIFE, London, United Kingdom(institute)
- Southern Cross University(institute)
Co-authors (8)
- Adam Elwood3 shared
- Edmundo Lopez-Sola3 shared
- Fionn Inglis3 shared
- Jakob Hohwy3 shared
- Jonathan Gold3 shared
- Lars Sandved-Smith3 shared
- Shamil Chandaria3 shared
- Ruben Laukkonen1 shared
Recent mentions (1)
- papers-typedlaukkonen-2025-contemplative-agent.md