finding
active
finding:up-to-33-6-reasoning-tokens-saved-on-mmlu-subsets-with-stepwise-steering-while-maintaining-accuracy-in-larger-modelsUp to 33.6% reasoning tokens saved on MMLU subsets with stepwise steering while maintaining accuracy in larger models
Maximum token savings achieved by ReflCtrl on non-mathematical general reasoning tasks
Source paper
extracted_from(2025) · Ge Yan · Sun, Chung-En · Tsui-Wei · Weng
Neighborhood — ranked by edge-count
Claims (1)
claim
- Key interpretive finding that stronger models can have reflections reduced with minimal accuracy cost
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Demonstrates reflection redundancy in larger models on non-mathematical reasoning
- Empirical observation motivating the need to control reflection for inference efficiency
- Empirical measurement motivating inference cost reduction via ReflCtrl
- Comparative claim between the two steering strategies
- Key result demonstrating advantage of stepwise over all-token steering strategy
- Shows that causal steering effects persist over long ranges for a substantial fraction of emotion probes
- We hypothesize that degraded generalization on benchmarks like MMLU may reflect the computational demands of the tasks.hypothesis0.750Connecting the paper's task-difficulty findings to prior observations of weak generalization on complex QA benchmarks.
- Initial evidence that alignment faking persona is more sensitive to exploiting training signals