paper
referenced-only
2021
paper:arxiv-2110-14168Training verifiers to solve math word problems
ByK. Cobbe·V. Kosaraju·M. Bavarian·M. Chen·H. Jun·L. Kaiser+4 more
Related work— refs + corpus + external arXiv
Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.
- Teaching for large-scale Reproducibility VerificationLars Vilhuber and Hyuk Harry Son and Meredith Welch and David N. Wasser and Michael Darisse2026≈ 71%
- ≈ 69%
- E-valuator: Reliable Agent Verifiers with Sequential Hypothesis TestingDrew Prinster, Clara Fannjiang, Gabriele Scalia, Aviv Regev, Hanchen Wang Shuvom Sadhuka2025≈ 68%
- DataDignity: Training Data Attribution for Large Language ModelsAndrzej Banburski-Fahey, Jaron Lanier Xiaomin Li2026≈ 68%
- A Pattern Language for Machine Learning TasksIan Fan, Tuomas Laakkonen, Neil John Ortega, Thomas Hoffmann, Vincent Wang-Mascianica Benjamin Rodatz2025≈ 67%
- Designing Perceptual Puzzles by Differentiating Probabilistic ProgramsTzu-Mao Li, Joshua Tenenbaum, Jonathan Ragan-Kelley Kartik Chandra2022≈ 67%
- AI co-mathematician: Accelerating mathematicians with agentic AIIngrid von Glehn, Yori Zwols, Iuliya Beloshapka, Lars Buesing, Daniel M. Roy, Martin Wattenberg, Bogdan Georgiev, Tatiana Schmidt, Andrew Cowie, Fernanda Viegas, Dimitri Kanevsky, Vineet Kahlon, Hartmut Maennel, Sophia Alj, George Holland, Alex Davies, Pushmeet Kohli Daniel Zheng2026≈ 67%
- Failing to Falsify: Evaluating and Mitigating Confirmation Bias in Language ModelsAnthony GX-Chen, Ilia Sucholutsky, Eunsol Choi Ayush Rajesh Jhaveri2026≈ 67%
- ≈ 66%
- Measuring What Matters: Benchmarking Generative, Multimodal, and Agentic AI in HealthcareHarshit Rajgarhia, Shivali Dalmia, Ananya Mantravadi Prasanna Desikan2026≈ 66%
- When Should Users Check? Modeling Confirmation Frequency inMulti-Step Agentic AI TasksAryan Roy, Sneh Gupta, Daniel Weitekamp, Christopher J. MacLellan Jieyu Zhou2026≈ 66%
- When Respondents Don't Care Anymore: Identifying the Onset of Careless RespondingMax Welz and Andreas Alfons2026≈ 66%
- ≈ 66%
- Learning to Model the World with LanguageYuqing Du, Olivia Watkins, Danijar Hafner, Pieter Abbeel, Dan Klein, Anca Dragan Jessy Lin2024≈ 66%
- Hint Tuning: Less Data Makes Better ReasonersMinghao Li, Xiaoqian Ma, Xiusheng Huang, Zhuo Chen, Bowen Qin, Liujie Zhang, Shuo Shang, Weihang Chen Siqi Fan2026≈ 66%
- Testing the Limits of Truth Directions in LLMsin corpus2026≈ 63%
- ≈ 62%
- Learning without neurons in physical systemsin corpus2022≈ 62%
- Probe-Based Data Attribution: Surfacing and Mitigating Undesirable Behaviors in LLM Post-Trainingin corpus2026≈ 62%
- Interpreting Language Model Parametersin corpus2026≈ 62%
- ≈ 61%
- The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasetsin corpus2023≈ 61%
- ≈ 61%
- Verbalized Eval Awareness Inflates Measured Safetyin corpus2026≈ 61%
- ≈ 61%
- ≈ 60%
- A Mathematical Framework for Transformer Circuitsin corpus2021≈ 60%
- ≈ 60%
- Opening the Hood of a Word Processorin corpus1984≈ 60%
Similar preprints — Semantic Scholar
Cited by (2)
- The Platonic Representation Hypothesis
Neural networks trained on different data modalities, architectures, and objectives are converging toward a shared statistical model of reality — what the paper terms the "platonic representation" — f
- Probe-Based Data Attribution: Surfacing and Mitigating Undesirable Behaviors in LLM Post-Training
Probe-based data attribution, introduced here as a method for surfacing and mitigating undesirable post-training behaviors, reduces harmful compliance in OLMo 2 7B by 63% through datapoint filtering a