finding
active
finding:probe-guided-early-exit-reduces-tokens-by-up-to-30-on-gpqa-diamond-with-similar-accuracy-on-deepseek-r1-671b-and-gpt-oss-120b

Probe-guided early exit reduces tokens by up to 30% on GPQA-Diamond with similar accuracy on DeepSeek-R1 671B and GPT-OSS 120B

Quantitative efficiency result on hard benchmark, smaller reduction reflecting genuine reasoning need

Source paper

extracted_from
Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought
(2026) · Siddharth Boppana · Annabel Ma · Max Loeffler · Raphaël Sarfati +4

Neighborhood — ranked by edge-count

Claims (1)

claim

Hypotheses (1)

hypothesis

Questions (1)

question

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Restated by (1)

cosine ≥ 0.90

Other entities that say roughly the same thing. May be merge candidates or independent restatements across papers.