finding
active
finding:qwq-32b-accuracy-on-gsm8k-remains-between-96-36-and-96-50-across-all-intervention-strengths-0-96-to-0-48QwQ-32B accuracy on GSM8k remains between 96.36% and 96.50% across all intervention strengths (-0.96 to +0.48)
Demonstrates that stronger models are largely insensitive to reflection manipulation
Source paper
extracted_from(2025) · Ge Yan · Sun, Chung-En · Tsui-Wei · Weng
Neighborhood — ranked by edge-count
Claims (1)
claim
- Key interpretive finding that stronger models can have reflections reduced with minimal accuracy cost
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Demonstrates reflection redundancy in larger models on non-mathematical reasoning
- Baseline accuracy when reflection is suppressed.
- Demonstrates reflection redundancy in stronger model on harder math benchmark
- Only model showing marginal benefit from increased reflection, at substantial token cost
- High cosine similarity for Gemma3 steering vectors suggests strong linear reflection structure.
- Triggered Reflection with 'Alternatively' achieves accuracy .684 on gsm8k_adv for Gemma3-4B-ITfinding0.775Highest single-instruction accuracy result in the paper.
- Layer-wise analysis revealing which network depths best encode strategic deception semantics
- Quantifies harness activation failure for weak-tier models vs. strong-tier models