finding
active
finding:qwq-32b-accuracy-on-mmlu-formal-logic-stays-between-95-5-and-96-3-across-all-intervention-strengths-while-tokens-reduced-from-1716-6-to-1481-4-at-0-96QwQ-32B accuracy on MMLU Formal Logic stays between 95.5% and 96.3% across all intervention strengths while tokens reduced from 1716.6 to 1481.4 at -0.96
Demonstrates reflection redundancy in larger models on non-mathematical reasoning
Source paper
extracted_from(2025) · Ge Yan · Sun, Chung-En · Tsui-Wei · Weng
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Demonstrates reflection redundancy in stronger model on harder math benchmark
- Demonstrates that stronger models are largely insensitive to reflection manipulation
- Shows smaller models are more sensitive to reflection reduction on non-math tasks
- Maximum token savings achieved by ReflCtrl on non-mathematical general reasoning tasks
- Out-of-domain generalization showing deception features track general representational honesty
- Likely-trained MM probe is a surprisingly effective causal baseline due to correlation between truth and probability on sp_en_trans
- Only model showing marginal benefit from increased reflection, at substantial token cost
- Core detection result showing LAT-based steering vectors can identify deceptive states with high accuracy