paper
referenced-only
2024
paper:arxiv-2402-03300DeepSeekMath: Pushing the limits of mathematical reasoning in open language models
ByZhihong Shao·Peiyi Wang·Qihao Zhu·Runxin Xu·Junxiao Song·Xiao Bi+4 more
Related work— refs + corpus + external arXiv
Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.
- A Pattern Language for Machine Learning TasksIan Fan, Tuomas Laakkonen, Neil John Ortega, Thomas Hoffmann, Vincent Wang-Mascianica Benjamin Rodatz2025≈ 78%
- An Automated Survey of Generative Artificial Intelligence: Large Language Models, Architectures, Protocols, and Applications\'Alvaro L\'opez L\'opez Eduardo C. Garrido-Merch\'an2026≈ 77%
- StreetMath: Study of LLMs' Approximation BehaviorsSomshubhra Roy, Maisha Thasin, Danyang Zhang, and Blessing Effiong Chiung-Yi Tseng2025≈ 77%
- ≈ 76%
- Artificial Intelligence and the Structure of MathematicsMichael R. Douglas, Michael H. Freedman Maissam Barkeshli2026≈ 76%
- Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-ThoughtAbulhair Saparov and He He2023≈ 76%
- ≈ 76%
- A geometric relation of the error introduced by sampling a language model's output distribution to its internal stateAlbert F. Modenbach2026≈ 75%
- A Survey of Theory of Mind in Large Language Models: Evaluations, Representations, and Safety RisksHieu Minh "Jord" Nguyen2025≈ 75%
- PersuasiveToM: A Benchmark for Evaluating Machine Theory of Mind in Persuasive DialoguesLai Jiang, Shenyi Huang, Zhen Wu, Xinyu Dai Fangxu Yu2025≈ 75%
- Reasoning with Language Model is Planning with World ModelYi Gu, Haodi Ma, Joshua Jiahua Hong, Zhen Wang, Daisy Zhe Wang, Zhiting Hu Shibo Hao2023≈ 75%
- Comprehension Without Competence: Architectural Limits of LLMs in Symbolic Computation and ReasoningZheng Zhang2025≈ 75%
- Probing LLM World Models: Enhancing Guesstimation with Wisdom of Crowds DecodingSameer Narendran, Nikunj Harlalka, Alexander Cheung, Sizhe Gao, Siddharth Suresh, Junjie Hu, Timothy T. Rogers Yun-Shiuan Chuang2025≈ 75%
- How FaR Are Large Language Models From Agents with Theory-of-Mind?Aman Madaan, Srividya Pranavi Potharaju, Aditya Gupta, Kevin R. McKee, Ari Holtzman, Jay Pujara, Xiang Ren, Swaroop Mishra, Aida Nematzadeh, Shyam Upadhyay, Manaal Faruqui Pei Zhou2023≈ 75%
- Do Language Models Follow Occam's Razor? An Evaluation of Parsimony in Inductive and Abductive ReasoningAbulhair Saparov Yunxin Sun2026≈ 75%
- ≈ 73%
- Opening the Hood of a Word Processorin corpus1984≈ 73%
- ≈ 73%
- ≈ 72%
- ≈ 72%
- ≈ 72%
- ≈ 72%
- ≈ 72%
- ≈ 72%
- Contemplative Agentin corpus2025≈ 71%
- ≈ 71%
- Active Inference, Curiosity and Insightin corpus2017≈ 71%
- The computational boundary of a 'self': developmental bioelectricity drives multicellularity and scale-free cognitionin corpus2019≈ 71%
- ≈ 71%
- The Platonic Representation Hypothesisin corpus2024≈ 71%
Similar preprints — Semantic Scholar
Cited by (2)
- SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents
Continual reinforcement learning applied directly to reasoning-optimized base models—rather than starting from instruction-tuned checkpoints—yields a 20-parameter-billion autonomous single-agent, SFR-
- ATLAS: Agentic or Latent Visual Reasoning? One Word is Enough for Both
ATLAS resolves a core trade-off in visual reasoning by introducing functional tokens — single discrete 'words' that simultaneously serve as agentic operations and latent visual reasoning units, elimin