finding

active

finding:no-reflection-with-answer-achieves-accuracy-037-on-gsm8k-adv-for-qwen2-5-3b

No Reflection with 'Answer' achieves accuracy .037 on gsm8k_adv for Qwen2.5-3B

Baseline accuracy when reflection is suppressed.

Source paper

extracted_from

Unveiling the Latent Directions of Reflection in Large Language Models

(2025) · Chang, Fu-Chieh · Lee, Yu-Ting · Wu, Pei-Yuan

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Triggered Reflection with 'Alternatively' achieves accuracy .684 on gsm8k_adv for Gemma3-4B-ITfinding0.852
Highest single-instruction accuracy result in the paper.
Reflection direction features achieve AUROC 0.772 vs. 0.736 for final layer baseline on deepseek-llama-8b on GSM8k correctness predictionfinding0.819
Supports claim that uncertainty is encoded in reflection direction
QwQ-32B accuracy on GSM8k remains between 96.36% and 96.50% across all intervention strengths (-0.96 to +0.48)finding0.817
Demonstrates that stronger models are largely insensitive to reflection manipulation
Easy questions (acc > 80%) have average reflection rate of 25.8% for DeepSeek-R1 Llama 8b on GSM8kfinding0.795
Baseline reflection rate for easy questions confirming difficulty-reflection correlation
Clear accuracy stratification across three reflection levels on cruxeval_o_adv: Triggered (.065/.247) > Intrinsic (.040/.133) > No Reflection (.017/.051) for Qwen2.5-3B/Gemma3-4B-ITfinding0.777
Core empirical result validating the three-level reflection framework on code reasoning.
Top-5 instructions by µ(1→2) at ℓ=12 achieve average cosine similarity .9893 and average accuracy .5645 on gsm8k_adv for Gemma3-4B-ITfinding0.775
High cosine similarity for Gemma3 steering vectors suggests strong linear reflection structure.
DeepSeek-R1 Llama 8b gains 0.16% accuracy on GSM8k with positive intervention (more reflections) at cost of ~2000 additional tokensfinding0.773
Only model showing marginal benefit from increased reflection, at substantial token cost
Higher reflection frequency correlates with lower accuracy partly because more reflections are generated on difficult questionsclaim0.758
Author's interpretation of the negative correlation between reflection rate and accuracy observed in Fig. 5