finding
active
finding:qwq-32b-accuracy-on-gsm8k-remains-between-96-36-and-96-50-across-all-intervention-strengths-0-96-to-0-48

QwQ-32B accuracy on GSM8k remains between 96.36% and 96.50% across all intervention strengths (-0.96 to +0.48)

Demonstrates that stronger models are largely insensitive to reflection manipulation

Source paper

extracted_from
ReflCtrl: Controlling LLM Reflection via Representation Engineering
(2025) · Ge Yan · Sun, Chung-En · Tsui-Wei · Weng

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.