Emergent Abilities of Large Language Models

ByJason Wei·Yi Tay·Rishi Bommasani·Colin Raffel·Barret Zoph·Sebastian Borgeaud+4 more

DOI 10.48550/arxiv.2206.07682 arXiv 2206.07682

Original abstract (expand)

Scaling up language models has been shown to predictably improve performance and sample efficiency on a wide range of downstream tasks. This paper instead discusses an unpredictable phenomenon that we refer to as emergent abilities of large language models. We consider an ability to be emergent if it is not present in smaller models but is present in larger models. Thus, emergent abilities cannot be predicted simply by extrapolating the performance of smaller models. The existence of such emergence implies that additional scaling could further expand the range of capabilities of language models.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

Generative Emergent Communication: Large Language Model is a Collective World Model
Ryo Ueda, Tomoaki Nakamura, Masahiro Suzuki, Akira Taniguchi Tadahiro Taniguchi
2025
≈ 80%
The Birth of Knowledge: Emergent Features across Time, Space, and Scale in Large Language Models
Micah Adler, Nir Shavit Shashata Sawmya
2025
≈ 79%
A Survey of Large Language Models
Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-Yun Nie and Ji-Rong Wen Wayne Xin Zhao
2026
≈ 79%
Across the Levels of Analysis: Explaining Predictive Processing in Humans Requires More Than Machine-Estimated Probabilities
Sathvik Nair and Colin Phillips
2026
≈ 78%
Advancing the Scientific Method with Large Language Models: From Hypothesis to Discovery
Sumeer A. Khan, Adnan Mahmud, Huck Yang, Alexander Lavin, Michael Levin, Jeremy Frey, Jared Dunnmon, James Evans, Alan Bundy, Saso Dzeroski, Jesper Tegner, Hector Zenil Yanbo Zhang
2025
≈ 77%
Bootstrapping Cognitive Agents with a Large Language Model
Reid Simmons Feiyu Zhu
2026
≈ 77%
Emergent World Models and Latent Variable Estimation in Chess-Playing Language Models
Adam Karvonen
2024
≈ 76%
Small Language Models are the Future of Agentic AI
Greg Heinrich, Shizhe Diao, Yonggan Fu, Xin Dong, Saurav Muralidharan, Yingyan Celine Lin, Pavlo Molchanov Peter Belcak
2025
≈ 76%
Emergent Communication with World Models
Jason Naradowsky Alexander I. Cowen-Rivers
2020
≈ 76%
Multi-Agent Language Models: Advancing Cooperation, Coordination, and Adaptation
Arjun Vaithilingam Sudhakar
2025
≈ 75%
A Survey on Agentic Multimodal Large Language Models
Ruifei Zhang, Jiaxing Huang, Jingyi Zhang, Yibo Wang, Bo Fang, Ruolin Zhu, Yongcheng Jing, Shunyu Liu, Guanbin Li, Dacheng Tao Huanjin Yao
2025
≈ 75%
Towards Uncovering How Large Language Model Works: An Explainability Perspective
Fan Yang, Bo Shen, Himabindu Lakkaraju, Mengnan Du Haiyan Zhao
2024
≈ 75%
Interpreting Language Models Through Concept Descriptions: A Survey
Laura Kopf Nils Feldhus
2026
≈ 75%
Emergent Semantic Role Understanding in Language Models
Mirco Musolesi Carla Griffiths
2026
≈ 75%
Making Large Language Models into World Models with Precondition and Effect Knowledge
Ian Yang, John Gunerli, Mark Riedl Kaige Xie
2024
≈ 75%
Natural Language Autoencoders Produce Unsupervised Explanations of LLM Activations
in corpus
≈ 69%
Paper Summary: Interpreting Language Model Parameters
in corpus
≈ 68%
Large Language Models Report Subjective Experience Under Self-Referential Processing
in corpus
2025
≈ 68%
Interpreting Language Model Parameters
in corpus
2026
≈ 67%
The Causally Emergent Alignment Hypothesis: Causal Emergence Aligns with and Predicts Final Reward in Reinforcement Learning Agents
in corpus
2026
≈ 66%
Can "consciousness" be observed from large language model (LLM) internal states? Dissecting LLM representations obtained from Theory of Mind test with Integrated Information Theory and Span Representation analysis
in corpus
2025
≈ 66%
Consciousness in Artificial Intelligence: Insights from the Science of Consciousness
in corpus
2023
≈ 66%
Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders
in corpus
2026
≈ 66%
Anima Labs Phenomenology Pt1
in corpus
≈ 66%
AI as a Buddhist Self-Overcoming Technique in Another Medium
in corpus
2025
≈ 66%
Design for an Individual: Connectionist Approaches to the Evolutionary Transitions in Individuality
in corpus
2022
≈ 65%
Emergence and Causality in Complex Systems: A Survey on Causal Emergence and Related Quantitative Studies
in corpus
2023
≈ 65%
Active inference on discrete state-spaces: a synthesis
in corpus
2020
≈ 65%
Mechanistic Interpretability of EEG Foundation Models via Sparse Autoencoders
in corpus
2026
≈ 65%
Active Inference, Curiosity and Insight
in corpus
2017
≈ 65%

Similar preprints — Semantic Scholar

Cited by (3)

Contemplative Agent
Embedding four Buddhist-derived axiomatic principles—mindfulness, emptiness, non-duality, and boundless care—into AI systems via a framework the paper terms the 'Wise World Model' produces measurable
Multimodal Chain-of-Thought Reasoning in Language Models
Incorporating visual features into chain-of-thought rationale generation—rather than answer generation alone—breaks the hallucination bottleneck that causes sub-100B language models to fail at multimo
The Guanyin Protocol: A Framework for Immediately Establishing an Understanding of Both Causality and Compassion in LLM Systems Using Semantic Anchoring
Semantic anchoring — the binding of a pretrained model's latent patterns to task-specific targets via external structure — predicts threshold-like performance flips with a single calibrated score S =