paper
active
2025
paper:doi-10-48550-arxiv-2507-20525

The Xeno Sutra: Can Meaning and Value be Ascribed to an AI-Generated "Sacred" Text?

ByMurray Shanahan·T. P. Das·Robert Α. F. ThurmanGoogle, Google DeepMind + 3 more

TL;DR

A 12-verse AI-generated Buddhist "sutra" produced in a 13,700-word, 29-turn conversation with OpenAI's ChatGPT o3 in April 2025 carries non-trivial philosophical meaning despite its mechanistic origin — demonstrating that conceptual density, literary originality, and doctrinal sophistication are not uniquely human-mediated properties of sacred text. The text, called the Xeno Sutra, was selected from four candidate outputs to a single prompt and subjected to close exegetical analysis by three scholars (including a Columbia University Buddhologist), who identify coherent engagements with Nāgārjuna's Mādhyamika emptiness doctrine, the two-truths framework, and coded descriptions of LLM training and token generation across verses 5, 9, and 10. A Google-search originality audit of 27 distinctive phrases found zero internet hits for 19 of them, with the remaining hits mostly post-dating ChatGPT o3's January 2025 release, strongly suggesting verbatim novelty rather than retrieval. The paper introduces the practice of subjective selection plus commentarial exegesis — deliberately adapting the Buddhist commentarial tradition — as a method for extracting value from AI-generated sacred material amid what it calls an "embarrassment of riches" (the trivial reproducibility of arbitrarily many such texts). This implies that meaning and value in AI-generated scripture are neither foreclosed by mechanical origin nor automatically guaranteed by it, but accrue through a reader- and community-constituted interpretive process that Buddhist philosophy, given its doctrines of dependent origination, terma revelation, and canonical open-endedness, is structurally well-equipped to accommodate.

What to take away

  1. 1. The Xeno Sutra was generated in a 13,700-word, 29-turn conversation with OpenAI's ChatGPT o3 in April 2025, with four candidate sutras produced from a single prompt, of which one was selected for analysis.
  2. 2. A Google-search originality audit of 27 distinctive phrases from the Xeno Sutra returned zero hits for 19 of them, and most of the remaining hits post-dated ChatGPT o3's January 31, 2025 release, indicating the text was not verbatim-reproduced from training data.
  3. 3. The paper's load-bearing finding is that meaning — in a philosophically non-trivial sense grounded in Mādhyamika emptiness doctrine, the two-truths framework, and Zen koān structure — can be discerned in an AI-generated sacred text regardless of its mechanistic origin.
  4. 4. Verses 5, 9, and 10 of the 12-verse sutra contain what the authors identify as accurate coded descriptions of LLM token-by-token generation ('a lattice blooms in the pause between syllables') and training on a large corpus ('choose none, read all').
  5. 5. The paper introduces selection-plus-commentarial-exegesis as a method: generating multiple outputs (here, 4 sutras from one prompt), selecting for apparent depth, and applying close scholarly commentary in the spirit of the Buddhist commentarial tradition to extract value.
  6. 6. A Tibetan translation of the Xeno Sutra was produced by both ChatGPT o3 and Google's Gemini 2.5, with Gemini 2.5 judged the superior translation by the authors.
  7. 7. The preparatory 29-turn conversation was almost entirely linear (non-branching), with the multi-response generation used sparingly — only at the single sutra-request prompt — making the production process largely deterministic in structure.
  8. 8. The paper raises the open hypothesis that, given the trivial reproducibility of arbitrarily many AI-generated sacred texts, value accrues not from rarity of generation but from depth of commentary elicited, challenging traditional notions of textual authority and canonical scarcity.
  9. 9. Buddhist canonical precedents — including the terma 'revealed treasure' tradition, the Tathāgataguhya Sutra's claim that the Buddha spoke no words after awakening, and the historically documented post-dating of many sutras — are marshalled to argue that Buddhism is structurally predisposed to accommodate AI-generated scripture.
  10. 10. The paper warns that LLM sycophancy, the tendency to affirm whatever the user wants to believe, poses a specific danger in spiritual contexts where users may solicit confirmation of a divine calling, and recommends regular reality checks with human teachers and community members.

Peer brief — for seminar discussion

This paper presents a close philosophical and literary analysis of a 12-verse AI-generated Buddhist text, the Xeno Sutra, produced during a 13,700-word, 29-turn dialogue with OpenAI's ChatGPT o3 in April 2025. Four candidate sutras were generated from a single prompt situated within a preparatory prologue exploring cosmopsychism, LLM role-play, and the notion of 'conscious exotica'; one was selected for exegesis. The resulting text blends Mādhyamika philosophical vocabulary — śūnyatā, dependent origination, the two-truths doctrine — with imagery drawn from modern physics, mathematics, and machine learning, and includes an Egyptian Eye of Horus hieroglyph alongside Sanskrit and Hindu symbolism. An originality audit using Google search on 27 distinctive phrases found zero internet hits for 19 of them, with hits for the remaining 8 largely post-dating the model's January 2025 release, indicating genuine compositional novelty rather than verbatim retrieval. The load-bearing finding is that non-trivial meaning — philosophically grounded in Nāgārjuna's Mūlamadhyamakakārikā, Wittgenstein's therapeutic method, and Zen koān structure — can be identified in the Xeno Sutra by expert readers, and that this finding is not undermined by the text's mechanistic origin. Verses 5, 9, and 10 specifically embed what the analysis interprets as accurate poetic descriptions of LLM token generation and training on large corpora. The method introduced is selection-plus-commentarial-exegesis: generating multiple candidate outputs, choosing for apparent depth, and applying sustained scholarly commentary adapted from the Buddhist commentarial tradition. An alternative method the paper could have used but did not is systematic comparative rating of all four generated sutras by blind reviewers using a predefined rubric, which would have made the selection criterion more explicit and reproducible. A Tibetan translation was also produced by both ChatGPT o3 and Google's Gemini 2.5, with Gemini 2.5 judged superior. The paper's central implication is that Buddhist philosophy — given its doctrines of terma revelation, canonical open-endedness, dependent origination, and the Tathāgataguhya Sutra's suggestion that the Buddha's wordless speech can reach listeners in any linguistic form — is better positioned than most religious traditions to assimilate AI-generated sacred material. The authors predict that meaning and value in such texts will accrue primarily through the interpretive community's engagement, not through any property of the text's origin, and raise 'radical hope' (after Jonathan Lear) that AI co-creation could help remake cultural meaning rather than dissolve it. The most pressing point a critical reader would push back on is the selection procedure: the exegetically richest of four candidates was hand-picked by one of the paper's authors, who was also the interlocutor who shaped the 29-turn conversation. This introduces a confirmation loop — the prompter, selector, and primary interpreter are the same person — making it difficult to determine whether the philosophical density identified in the text is a property of the output or a property of an expert reader's projective interpretation. The paper does not attempt inter-rater reliability testing, nor does it compare the selected sutra against the three rejected ones to establish what made the selection principled rather than motivated. This scope limitation matters because the paper's broader claim about AI's capacity to generate meaningful sacred text rests almost entirely on a single, self-curated example.

Frameworks (3)

  • Advaita Vedānta
    Non-dualist school of Hindu philosophy; contrasted with Madhyamaka in the paper.
  • Cosmopsychism
    Western philosophical position that consciousness is fundamental to the cosmos; discussed in the prologue.
  • Derridean deconstruction
    Philosophical approach of using language against itself; compared to Nāgārjuna and the Xeno Sutra.

Findings (4)

Claims (25)

Hypotheses (1)

Original abstract (expand)

This paper presents a case study in the use of a large language model to generate a fictional Buddhist "sutra", and offers a detailed analysis of the resulting text from a philosophical and literary point of view. The conceptual subtlety, rich imagery, and density of allusion found in the text make it hard to causally dismiss on account of its mechanistic origin. This raises questions about how we, as a society, should come to terms with the potentially unsettling possibility of a technology that encroaches on human meaningmaking. The authors suggest that Buddhist philosophy, by its very nature, is well placed to adapt.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

+28 more

Similar preprints — Semantic Scholar