Competition Dynamics Shape Algorithmic Phases of In-Context Learning

ByCore Francisco Park·E. Lubana·I. Pres·Hidenori Tanaka

DOI 10.48550/arxiv.2412.01003 arXiv 2412.01003

Original abstract (expand)

In-Context Learning (ICL) has significantly expanded the general-purpose nature of large language models, allowing them to adapt to novel tasks using merely the inputted context. This has motivated a series of papers that analyze tractable synthetic domains and postulate precise mechanisms that may underlie ICL. However, the use of relatively distinct setups that often lack a sequence modeling nature to them makes it unclear how general the reported insights from such studies are. Motivated by this, we propose a synthetic sequence modeling task that involves learning to simulate a finite mixture of Markov chains. As we show, models trained on this task reproduce most well-known results on ICL, hence offering a unified setting for studying the concept. Building on this setup, we demonstrate we can explain a model's behavior by decomposing it into four broad algorithms that combine a fuzzy retrieval vs. inference approach with either unigram or bigram statistics of the context. These algorithms engage in a competition dynamics to dominate model behavior, with the precise experimental conditions dictating which algorithm ends up superseding others: e.g., we find merely varying context size or amount of training yields (at times sharp) transitions between which algorithm dictates the model behavior, revealing a mechanism that explains the transient nature of ICL. In this sense, we argue ICL is best thought of as a mixture of different algorithms, each with its own peculiarities, instead of a monolithic capability. This also implies that making general claims about ICL that hold universally across all settings may be infeasible.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

The mechanistic basis of data dependence and abrupt learning in an in-context classification task
Gautam Reddy
2023
≈ 78%
Context and Diversity Matter: The Emergence of In-Context Learning in World Models
Zhiyuan Chen, Yuxuan Zhong, Sunjian Zheng, Pengtao Shao, Bo Yu, Shaoshan Liu, Jianan Wang, Ning Ding, Yang Cao and Yu Kang Fan Wang
2026
≈ 77%
Strategy Coopetition Explains the Emergence and Transience of In-Context Learning
Ted Moskovitz, Sara Dragutinovic, Felix Hill, Stephanie C.Y. Chan, Andrew M. Saxe Aaditya K. Singh
2025
≈ 75%
Differential learning kinetics govern the transition from memorization to generalization during in-context learning
Gautam Reddy Alex Nguyen
2024
≈ 75%
Two Ways of Understanding Social Dynamics: Analyzing the Predictability of Emergence of Objects in Reddit r/place Dependent on Locality in Space and Time
Javier Fernandez, Olaf Witkowski Alyssa M Adams
2022
≈ 74%
Deep Latent Competition: Learning to Race Using Visual Control Policies in Latent Space
Tim Seyde, Igor Gilitschenski, Lucas Liebenwein, Ryan Sander, Sertac Karaman, Daniela Rus Wilko Schwarting
2021
≈ 74%
A Survey of Learning in Multiagent Environments: Dealing with Non-Stationarity
Michael Kaisers, Tim Baarslag and Enrique Munoz de Cote Pablo Hernandez-Leal
2019
≈ 74%
Next-token pretraining implies in-context learning
Paul M. Riechers and Henry R. Bigelow and Eric A. Alt and Adam Shai
2025
≈ 74%
COMBAT: Conditional World Models for Behavioral Agent Training
Pranay Meshram, Sumer Singh, Saurav Suman, Andrew Lapp, Shahbuland Matiana, Louis Castricato, Spencer Frazier Anmol Agarwal
2026
≈ 73%
AI, Meet Human: Learning Paradigms for Hybrid Decision Making Systems
Roberto Pellungrini, Mattia Setzu, Fosca Giannotti and Dino Pedreschi Clara Punzi
2026
≈ 73%
Compete and Compose: Learning Independent Mechanisms for Modular World Models
Frederik Nolte, Bernhard Sch\"olkopf, Ingmar Posner Anson Lei
2024
≈ 73%
Illusion or Algorithm? Investigating Memorization, Emergence, and Symbolic Processing in In-Context Learning
Subhabrata Dutta, Ahmed Elshabrawy, Harish Tayyar Madabushi, Iryna Gurevych Jingcheng Niu
2025
≈ 73%
Human strategic decision making in parametrized games
Sam Ganzfried
2026
≈ 73%
Episodic Memory for Learning Subjective-Timescale Models
Matthew Crosby, Zafeirios Fountas Alexey Zakharov
2020
≈ 73%
Learning Latent Action World Models In The Wild
Tushar Nagarajan, Basile Terver, Nicolas Ballas, Yann LeCun, Michael Rabbat Quentin Garrido
2026
≈ 73%
Learning without neurons in physical systems
in corpus
2022
≈ 71%
2022-09-23_Prabros._dynamics-in-action-pdf1.pdf_2f6a2b
in corpus
≈ 71%
Active inference on discrete state-spaces: a synthesis
in corpus
2020
≈ 70%
The Causally Emergent Alignment Hypothesis: Causal Emergence Aligns with and Predicts Final Reward in Reinforcement Learning Agents
in corpus
2026
≈ 69%
Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders
in corpus
2026
≈ 68%
Active Inference: A Process Theory
in corpus
2017
≈ 68%
Topological constraints on self-organization in locally interacting systems
in corpus
2026
≈ 68%
The Guanyin Protocol: A Framework for Immediately Establishing an Understanding of Both Causality and Compassion in LLM Systems Using Semantic Anchoring
in corpus
2025
≈ 68%
Emergence and Causality in Complex Systems: A Survey on Causal Emergence and Related Quantitative Studies
in corpus
2023
≈ 68%
An association-based model of dynamic behaviour
in corpus
2011
≈ 68%
Active inference: demystified and compared
in corpus
2021
≈ 68%
A Free energy principle for the brain (lecture summary)
in corpus
2008
≈ 67%
Information, Processes and Games
in corpus
≈ 67%
The World Inside Neural Networks
in corpus
2026
≈ 67%
A tale of two densities: active inference is enactive inference
in corpus
2020
≈ 67%

Similar preprints — Semantic Scholar

Cited by (1)

The Guanyin Protocol: A Framework for Immediately Establishing an Understanding of Both Causality and Compassion in LLM Systems Using Semantic Anchoring
Semantic anchoring — the binding of a pretrained model's latent patterns to task-specific targets via external structure — predicts threshold-like performance flips with a single calibrated score S =