question
active
question:if-chinese-models-distilled-claude-s-reflective-patterns-do-their-per-koan-failure-patterns-correlate-with-claude-s-not-just-successesIf Chinese models distilled Claude's reflective patterns, do their per-koan failure patterns correlate with Claude's — not just successes?
More rigorous test of H5a trace distillation hypothesis
Source paper
extracted_from(2026) · Borzov, Anton
Neighborhood — ranked by edge-count
Papers (1)
paper
- Koan Battery: Measuring Reflective Mode Accessibility in AIassociated_with
Hypotheses (1)
hypothesis
- Exploratory hypothesis NOT supported at individual model level (Haiku-Kimi rho=0.123, p=0.52)
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Exploratory interpretation of Chinese model performance under contemplative prompt
- Tests whether contemplative capacity is language-encoded or architecture-general
- Constitutional AI fingerprint in dimension profile; training that makes models self-observant also makes them polished at cost to aliveness
- Key interpretive finding that stronger models can have reflections reduced with minimal accuracy cost
- Supported by the geometric transition visible in cosine similarity heatmaps for F0-F3.
- Promising future research direction about the internal mechanism of error detection.
- Per-category analysis showing reflection rate does not help within difficulty class
- The model tends to reflect more when the question is difficult, and accuracy is generally lower for harder questionshypothesis0.751Hypothesis explaining negative correlation between reflection rate and accuracy without implying reflection is harmful