community
active
leiden_hybrid_concepts
label: haiku
community:leiden_hybrid_concepts-run4-c15-c4Textual originality and training data inference
Methods for detecting novel phrases absent from web indices and likely outside LLM training corpora, using Google search null results as a proxy metric.
2 members. Each node is clickable.
Loading graph…
Drawn from 1 source
The papers/notes whose extracted claims & findings make up this cluster.
Bridges (1)
Other communities that share members with this one — cross-cutting threads or papers that sit at the seam between two themes.
Claims (1)
- The Xeno Sutra is original insofar as many of its striking phrases are unlikely to have been in the training set.Conclusion from the Google search findings.
Findings (1)
- Most unique phrases in the Xeno Sutra (e.g., 'Thus have I heard beyond numbers and names', 'seed without center') yield zero Google search results as of 1 July 2025.Empirical originality check from Table 1, supporting the claim of originality.