thinker:shamil-chandariaShamil Chandaria
Authored papers (1)
Embedding four Buddhist-derived axiomatic principles—mindfulness, emptiness, non-duality, and boundless care—into AI systems via a framework the paper terms the 'Wise World Model' produces measurable alignment gains and cooperation boosts in current transformer-based LLMs. Pilot experiments on GPT-4o and GPT-4.1 nano using structured contemplative prompts yielded statistically significant safety improvements across ten hazard categories on the AILuminate Benchmark (d=.96 against baseline standard prompting), and drove cooperation rates and joint reward substantially upward in an Iterated Prisoner's Dilemma across 50 simulated 10-round games (d=7+), with boundless-care and non-duality prompts producing the largest effects even against always-defecting opponents. Three implementation pathways are introduced—Contemplative Architecture (full-stack active inference embedding), Contemplative Constitutional AI (CCAI, extending Anthropic's Constitutional AI framework with a 'wisdom charter'), and Contemplative Reinforcement Learning (CRL) on chain-of-thought—each targeting different integration depths from generative-model parameters to inference-time classifiers. The paper argues that because these principles restructure how goals, beliefs, and self-other boundaries are encoded rather than prescribing what specific values to hold, they provide scale-resilient intrinsic alignment that does not degrade as AI capability outstrips human oversight—contrasting with extrinsic methods like RLHF or rule-based constraints that become gameable at superintelligent scales.
More papers — OpenAlex / S2
Co-authors (8)
- Adam Elwood5 shared
- Edmundo Lopez-Sola5 shared
- Fionn Inglis5 shared
- Jakob Hohwy5 shared
- Jonathan Gold5 shared
- Lars Sandved-Smith5 shared
- Ruben Laukkonen3 shared
- Ruben Laukkonen2 shared
Recent mentions (1)
- papers-typedlaukkonen-2025-contemplative-agent.md