finding
active
finding:chain-of-thought-reasoning-improves-large-model-accuracy-on-hhh-binary-comparisons-reaching-78-for-52b-model-competitive-with-human-feedback-pmChain-of-thought reasoning improves large model accuracy on HHH binary comparisons, reaching ~78% for 52B model, competitive with human-feedback PM.
Figure 4 shows CoT improves over zero-shot, and ensembled CoT further boosts accuracy.
Source paper
extracted_from(2022) · Bai, Yuntao · Saurav Kadavath · Sandipan Kundu · Amanda Askell +47
Neighborhood — ranked by edge-count
Claims (1)
claim
- CoT improves accuracy on HHH evals and makes the decision process legible.
Communities (3)
community
- Spans attention head decomposition, benchmark awareness, and genomic pathogenicity prediction via neural models.
- Probing early detection of model confidence during chain-of-thought reasoning to optimize inference efficiency and identify confabulation patterns.
- Examines whether verbalized reasoning chains reflect actual internal computation or post-hoc rationalization, using behavioral analysis and representation studies.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- A small number of high-quality human demonstrations of chain-of-thought reasoning could be used to improve and focus performance.hypothesis0.830Section 6 mentions high-quality human demos could improve natural language feedback.
- Chain-of-thought prompting elicits reasoning in large language models (Wei et al., 2022)concept0.811Foundational paper on CoT prompting cited as basis for reasoning LLM training
- Medium through which eval awareness is often verbalized; target of intervention.
- under what conditions does chain-of-thought reflect genuine uncertainty resolution versus a learned performance?question0.776Key question addressed by the task difficulty analysis comparing MMLU and GPQA-Diamond
- All models performed substantially above chance (10%) on distinguishing injected thought from text inputfinding0.762All tested models could both identify the injected concept and transcribe the input sentence well above random.
- Explanation of how knowledge (not just parameters) is shared between agents; links to pre-Cartesian consciousness
- Validates using chain-of-thought belief monitoring as proxy for behavioral steering efficacy.
- Central research question motivating the paper