Shamil Chandaria

openalex A5085626035 name_hash 060313a2f36ec187f26d96a1…

Authored

Introduces

Studies

Affiliations

Cited by

Authored papers (1)

Contemplative Agent2025ⓒ 1
Embedding four Buddhist-derived axiomatic principles—mindfulness, emptiness, non-duality, and boundless care—into AI systems via a framework the paper terms the 'Wise World Model' produces measurable alignment gains and cooperation boosts in current transformer-based LLMs. Pilot experiments on GPT-4o and GPT-4.1 nano using structured contemplative prompts yielded statistically significant safety improvements across ten hazard categories on the AILuminate Benchmark (d=.96 against baseline standard prompting), and drove cooperation rates and joint reward substantially upward in an Iterated Prisoner's Dilemma across 50 simulated 10-round games (d=7+), with boundless-care and non-duality prompts producing the largest effects even against always-defecting opponents. Three implementation pathways are introduced—Contemplative Architecture (full-stack active inference embedding), Contemplative Constitutional AI (CCAI, extending Anthropic's Constitutional AI framework with a 'wisdom charter'), and Contemplative Reinforcement Learning (CRL) on chain-of-thought—each targeting different integration depths from generative-model parameters to inference-time classifiers. The paper argues that because these principles restructure how goals, beliefs, and self-other boundaries are encoded rather than prescribing what specific values to hold, they provide scale-resilient intrinsic alignment that does not degrade as AI capability outstrips human oversight—contrasting with extrinsic methods like RLHF or rule-based constraints that become gameable at superintelligent scales.

More papers — OpenAlex / S2

Co-authors (8)

Recent mentions (1)

papers-typed
laukkonen-2025-contemplative-agent.md