paper:doi-10-48550-arxiv-2206-07682Emergent Abilities of Large Language Models
Original abstract (expand)
Scaling up language models has been shown to predictably improve performance and sample efficiency on a wide range of downstream tasks. This paper instead discusses an unpredictable phenomenon that we refer to as emergent abilities of large language models. We consider an ability to be emergent if it is not present in smaller models but is present in larger models. Thus, emergent abilities cannot be predicted simply by extrapolating the performance of smaller models. The existence of such emergence implies that additional scaling could further expand the range of capabilities of language models.
Related work— refs + corpus + external arXiv
Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.
- Generative Emergent Communication: Large Language Model is a Collective World ModelRyo Ueda, Tomoaki Nakamura, Masahiro Suzuki, Akira Taniguchi Tadahiro Taniguchi2025≈ 80%
- The Birth of Knowledge: Emergent Features across Time, Space, and Scale in Large Language ModelsMicah Adler, Nir Shavit Shashata Sawmya2025≈ 79%
- A Survey of Large Language ModelsKun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-Yun Nie and Ji-Rong Wen Wayne Xin Zhao2026≈ 79%
- Across the Levels of Analysis: Explaining Predictive Processing in Humans Requires More Than Machine-Estimated ProbabilitiesSathvik Nair and Colin Phillips2026≈ 78%
- Advancing the Scientific Method with Large Language Models: From Hypothesis to DiscoverySumeer A. Khan, Adnan Mahmud, Huck Yang, Alexander Lavin, Michael Levin, Jeremy Frey, Jared Dunnmon, James Evans, Alan Bundy, Saso Dzeroski, Jesper Tegner, Hector Zenil Yanbo Zhang2025≈ 77%
- ≈ 77%
- Emergent World Models and Latent Variable Estimation in Chess-Playing Language ModelsAdam Karvonen2024≈ 76%
- Small Language Models are the Future of Agentic AIGreg Heinrich, Shizhe Diao, Yonggan Fu, Xin Dong, Saurav Muralidharan, Yingyan Celine Lin, Pavlo Molchanov Peter Belcak2025≈ 76%
- ≈ 76%
- Multi-Agent Language Models: Advancing Cooperation, Coordination, and AdaptationArjun Vaithilingam Sudhakar2025≈ 75%
- A Survey on Agentic Multimodal Large Language ModelsRuifei Zhang, Jiaxing Huang, Jingyi Zhang, Yibo Wang, Bo Fang, Ruolin Zhu, Yongcheng Jing, Shunyu Liu, Guanbin Li, Dacheng Tao Huanjin Yao2025≈ 75%
- Towards Uncovering How Large Language Model Works: An Explainability PerspectiveFan Yang, Bo Shen, Himabindu Lakkaraju, Mengnan Du Haiyan Zhao2024≈ 75%
- ≈ 75%
- ≈ 75%
- Making Large Language Models into World Models with Precondition and Effect KnowledgeIan Yang, John Gunerli, Mark Riedl Kaige Xie2024≈ 75%
- ≈ 69%
- ≈ 68%
- ≈ 68%
- Interpreting Language Model Parametersin corpus2026≈ 67%
- ≈ 66%
- ≈ 66%
- ≈ 66%
- Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencodersin corpus2026≈ 66%
- Anima Labs Phenomenology Pt1in corpus≈ 66%
- ≈ 66%
- Design for an Individual: Connectionist Approaches to the Evolutionary Transitions in Individualityin corpus2022≈ 65%
- Emergence and Causality in Complex Systems: A Survey on Causal Emergence and Related Quantitative Studiesin corpus2023≈ 65%
- ≈ 65%
- ≈ 65%
- Active Inference, Curiosity and Insightin corpus2017≈ 65%
Similar preprints — Semantic Scholar
Cited by (3)
- Contemplative Agent
Embedding four Buddhist-derived axiomatic principles—mindfulness, emptiness, non-duality, and boundless care—into AI systems via a framework the paper terms the 'Wise World Model' produces measurable
- Multimodal Chain-of-Thought Reasoning in Language Models
Incorporating visual features into chain-of-thought rationale generation—rather than answer generation alone—breaks the hallucination bottleneck that causes sub-100B language models to fail at multimo
- The Guanyin Protocol: A Framework for Immediately Establishing an Understanding of Both Causality and Compassion in LLM Systems Using Semantic Anchoring
Semantic anchoring — the binding of a pretrained model's latent patterns to task-specific targets via external structure — predicts threshold-like performance flips with a single calibrated score S =