finding
active
finding:deepseek-r1-llama-8b-gains-0-16-accuracy-on-gsm8k-with-positive-intervention-more-reflections-at-cost-of-2000-additional-tokens

DeepSeek-R1 Llama 8b gains 0.16% accuracy on GSM8k with positive intervention (more reflections) at cost of ~2000 additional tokens

Only model showing marginal benefit from increased reflection, at substantial token cost

Source paper

extracted_from
ReflCtrl: Controlling LLM Reflection via Representation Engineering
(2025) · Ge Yan · Sun, Chung-En · Tsui-Wei · Weng

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.