question
active
question:does-reflective-depth-scale-linearly-with-inference-compute-budgetDoes reflective depth scale linearly with inference compute budget?
Grok 4 vs Fast shows ~1pt compute difference; whether this scales linearly is unresolved
Source paper
extracted_from(2026) · Borzov, Anton
Neighborhood — ranked by edge-count
Papers (1)
paper
- Koan Battery: Measuring Reflective Mode Accessibility in AIassociated_with
Hypotheses (1)
hypothesis
- Exploratory hypothesis supported by Grok 4 vs Fast ~1pt difference
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Interpretive claim about the locus of reflection in transformer architecture.
- Theoretical limitation identified by the authors distinguishing reflection from stylistic tasks.
- Grok 4: baseline 2.24, prompted 6.48; Gemini 3.1 Pro: 1.97→6.18. Reflective mode exists but is suppressed in default interaction.
- Core claim of ReflCtrl that a single direction captures and controls reflection
- Interpretation of Grok 4 vs Grok 4 Fast per-koan comparison
- Supported by the instruction discovery experiments comparing steering vs. embedding baselines.