finding
active
finding:qwq-32b-accuracy-on-mmlu-formal-logic-stays-between-95-5-and-96-3-across-all-intervention-strengths-while-tokens-reduced-from-1716-6-to-1481-4-at-0-96

QwQ-32B accuracy on MMLU Formal Logic stays between 95.5% and 96.3% across all intervention strengths while tokens reduced from 1716.6 to 1481.4 at -0.96

Demonstrates reflection redundancy in larger models on non-mathematical reasoning

Source paper

extracted_from
ReflCtrl: Controlling LLM Reflection via Representation Engineering
(2025) · Ge Yan · Sun, Chung-En · Tsui-Wei · Weng

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.