finding
active
finding:on-gpqa-diamond-multihop-questions-activation-probes-show-genuine-belief-shifts-during-cot-generation-rather-than-early-stabilization-contrasting-with-mmlu

On GPQA-Diamond multihop questions, activation probes show genuine belief shifts during CoT generation rather than early stabilization, contrasting with MMLU

Empirical finding contrasting difficult questions with easy ones, supporting genuine reasoning on hard tasks

Source paper

extracted_from
Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought
(2026) · Siddharth Boppana · Annabel Ma · Max Loeffler · Raphaël Sarfati +4

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.