hypothesis
active
hypothesis:the-model-tends-to-reflect-more-when-the-question-is-difficult-and-accuracy-is-generally-lower-for-harder-questionsThe model tends to reflect more when the question is difficult, and accuracy is generally lower for harder questions
Hypothesis explaining negative correlation between reflection rate and accuracy without implying reflection is harmful
Source paper
extracted_from(2025) · Ge Yan · Sun, Chung-En · Tsui-Wei · Weng
Neighborhood — ranked by edge-count
Findings (1)
finding
- Easy questions (acc > 80%) have average reflection rate of 25.8% for DeepSeek-R1 Llama 8b on GSM8ksupportsBaseline reflection rate for easy questions confirming difficulty-reflection correlation
Claims (1)
claim
- Author's interpretation of the negative correlation between reflection rate and accuracy observed in Fig. 5
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Implication of PRH for AI fairness and bias
- Bigger models are more likely to converge to a shared representation than smaller modelshypothesis0.797Selective pressure toward convergence via model capacity
- Second central research question motivating ReflCtrl investigation
- Per-category analysis showing reflection rate does not help within difficulty class
- Earlier/less capable models exhibit a larger gap between think and don't think representation strengthfinding0.786Claude 3 models show a bigger difference than newer models like Opus 4.1.
- Motivating hypothesis for Section 5's investigation of prompt template effects.