Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?

BySewon Min·Xinxi Lyu·Ari Holtzman·Mikel Artetxe·M. Lewis·Hannaneh Hajishirzi+1 more

DOI 10.18653/v1/2022.emnlp-main.759 arXiv 2202.12837

Original abstract (expand)

Large language models (LMs) are able to in-context learn—perform a new task via inference alone by conditioning on a few input-label pairs (demonstrations) and making predictions for new inputs. However, there has been little understanding of how the model learns and which aspects of the demonstrations contribute to end task performance. In this paper, we show that ground truth demonstrations are in fact not required—randomly replacing labels in the demonstrations barely hurts performance on a range of classification and multi-choce tasks, consistently over 12 different models including GPT-3. Instead, we find that other aspects of the demonstrations are the key drivers of endtask performance, including the fact that they provide a few examples of (1) the label space, (2) the distribution of the input text, and (3) the overall format of the sequence. Together, our analysis provides a new way of understanding how and why in-context learning works, while opening up new questions about how much can be learned from large language models through inference alone.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

The Mystery of In-Context Learning: A Comprehensive Survey on Interpretation and Analysis
Jiazheng Li, Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He Yuxiang Zhou
2024
≈ 77%
Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale
Karthik Gopalakrishnan, Saket Dingliwal, Sravan Bodapati, Katrin Kirchhoff, Dan Roth Hritik Bansal
2023
≈ 75%
Emergent Misalignment via In-Context Learning: Narrow in-context examples can produce broadly misaligned LLMs
Nikita Andriianov, Vahagn Hovhannisyan, Nikhil Bageshpura, Kyle Liu, Kevin Zhu, Sunishchal Dev, Ashwinee Panda, Oleg Rogov, Elena Tutubalina, Alexander Panchenko, Mikhail Seleznyov Nikita Afonin
2026
≈ 75%
Procedure Planning in Instructional Videos via Contextual Modeling and Model-based Policy Learning
Jiebo Luo, Chenliang Xu Jing Bi
2021
≈ 74%
Illusion or Algorithm? Investigating Memorization, Emergence, and Symbolic Processing in In-Context Learning
Subhabrata Dutta, Ahmed Elshabrawy, Harish Tayyar Madabushi, Iryna Gurevych Jingcheng Niu
2025
≈ 74%
Crafting a Good Prompt or Providing Exemplary Dialogues? A Study of In-Context Learning for Persona-based Dialogue Generation
Yajing Wan, Yuru Zhang, Jing Chen, Ling Cheng, Qian Shao, Yongzhu Chang, Tangjie Lv, Rongsheng Zhang Jiashu Pu
2024
≈ 74%
Differential learning kinetics govern the transition from memorization to generalization during in-context learning
Gautam Reddy Alex Nguyen
2024
≈ 74%
Context and Diversity Matter: The Emergence of In-Context Learning in World Models
Zhiyuan Chen, Yuxuan Zhong, Sunjian Zheng, Pengtao Shao, Bo Yu, Shaoshan Liu, Jianan Wang, Ning Ding, Yang Cao and Yu Kang Fan Wang
2026
≈ 74%
Next-token pretraining implies in-context learning
Paul M. Riechers and Henry R. Bigelow and Eric A. Alt and Adam Shai
2025
≈ 73%
The mechanistic basis of data dependence and abrupt learning in an in-context classification task
Gautam Reddy
2023
≈ 73%
How Interest-Driven Content Creation Shapes Opportunities for Informal Learning in Scratch: A Case Study on Novices' Use of Data Structures
Sayamindu Dasgupta, Benjamin Mako Hill Ruijia Cheng
2026
≈ 73%
On Meta-Prompting
Xun Wang, Qilong Gu, Si-Qing Chen Adrian de Wynter
2026
≈ 72%
Emergent social transmission of model-based representations without inference
Miriam Bautista-Salinero, Claudio Tennie and Charley M. Wu Silja Ke{\ss}ler
2026
≈ 72%
Evaluation Framework for Highlight Explanations of Context Utilisation in Language Models
Pepa Atanasova, Sagnik Ray Choudhury, Sekh Mainul Islam, Isabelle Augenstein Jingyi Sun
2026
≈ 72%
Enhancing Multimodal In-Context Learning via Inductive-Deductive Reasoning
Haonan Wang, Yuyan Chen, Jun Chen, Gang Liu, Qian Wang, Jiahong Yan, Yanghua Xiao Haoyu Wang
2026
≈ 72%
Why Learning Requires Feeling
in corpus
2026
≈ 69%
Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought
in corpus
2026
≈ 67%
Verbalized Eval Awareness Inflates Measured Safety
in corpus
2026
≈ 66%
Simulators — LessWrong
in corpus
≈ 66%
Learning without neurons in physical systems
in corpus
2022
≈ 66%
Multiple ways to implement and infer sentience
in corpus
≈ 65%
Steering Along Manifolds to Control Neural Networks
in corpus
≈ 65%
Anima Labs Phenomenology Pt1
in corpus
≈ 65%
A Free energy principle for the brain (lecture summary)
in corpus
2008
≈ 65%
Alignment faking in large language models
in corpus
2024
≈ 65%
The World Inside Neural Networks
in corpus
2026
≈ 65%
The Platonic Representation Hypothesis
in corpus
2024
≈ 65%
Probe-Based Data Attribution: Surfacing and Mitigating Undesirable Behaviors in LLM Post-Training
in corpus
2026
≈ 65%
Brains and where else? Mapping theories of consciousness to unconventional embodiments
in corpus
2026
≈ 65%
Active inference on discrete state-spaces: a synthesis
in corpus
2020
≈ 64%

Similar preprints — Semantic Scholar

Cited by (1)

The Guanyin Protocol: A Framework for Immediately Establishing an Understanding of Both Causality and Compassion in LLM Systems Using Semantic Anchoring
Semantic anchoring — the binding of a pretrained model's latent patterns to task-specific targets via external structure — predicts threshold-like performance flips with a single calibrated score S =