hypothesis
active
hypothesis:h5a-chinese-models-distilled-claude-s-reflective-traces-their-per-koan-error-patterns-should-correlate-with-claude-sH5a: Chinese models distilled Claude's reflective traces — their per-koan error patterns should correlate with Claude's.
Exploratory hypothesis NOT supported at individual model level (Haiku-Kimi rho=0.123, p=0.52)
Source paper
extracted_from(2026) · Borzov, Anton
Neighborhood — ranked by edge-count
Findings (1)
finding
- Haiku-Kimi per-koan correlation rho=0.123 (p=0.52); H5a trace distillation not supported at individual model levelassociated_withGroup correlation (rho=0.634) dissolves at individual level; shared posture not shared voice
Questions (1)
question
- More rigorous test of H5a trace distillation hypothesis
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Exploratory interpretation of Chinese model performance under contemplative prompt
- Exploratory hypothesis supported by Kimi 7.74 under prompt
- H5: Chinese training data contains more Buddhist and contemplative text, broadly helping Chinese models under contemplative framing.hypothesis0.788Exploratory hypothesis supported by Kimi K2.5 scoring 6.28
- Constitutional AI fingerprint in dimension profile; training that makes models self-observant also makes them polished at cost to aliveness
- Tests whether contemplative capacity is language-encoded or architecture-general
- Supported by the geometric transition visible in cosine similarity heatmaps for F0-F3.
- Claude Opus 4.1 and 4 show greatest reduction in apology rate in the prefill detection taskfinding0.750Injecting a concept matching the prefilled word reduces the rate at which the model apologizes, maximally for Opus models.
- Suggests that later models can keep the thought 'silent' rather than letting it influence output.