question
active
question:under-what-conditions-does-chain-of-thought-reflect-genuine-uncertainty-resolution-versus-a-learned-performance

under what conditions does chain-of-thought reflect genuine uncertainty resolution versus a learned performance?

Key question addressed by the task difficulty analysis comparing MMLU and GPQA-Diamond

Source paper

extracted_from
Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought
(2026) · Siddharth Boppana · Annabel Ma · Max Loeffler · Raphaël Sarfati +4

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.