finding
active
finding:induction-heads-in-two-layer-models-successfully-perform-in-context-learning-on-completely-random-repeated-token-sequences-far-outside-training-distributionInduction heads in two-layer models successfully perform in-context learning on completely random repeated token sequences far outside training distribution
Strong test of the induction head hypothesis using uniformly sampled random tokens repeated three times
Neighborhood — ranked by edge-count
Claims (3)
claim
- Central empirical claim of the paper; induction heads are shown to be the mechanism for powerful in-context learning
- The mechanistic explanation of how induction heads are implemented in two-layer models
- Core claim for two-layer models; composition creates qualitatively more powerful in-context learning
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Forward-looking claim connecting toy model findings to large-scale language models
- A follow-up paper extending the framework and induction head concept to larger more realistic models
- Quantitative verification of the mechanistic theory; both circuits required for the induction algorithm show the predicted copying/matching structure
- Striking mechanistic finding that injection creates universally detectable perturbation in residual stream immediately downstream
- Controls for dataset structure, showing trained model activations have richer structure than data distribution alone
- Concrete example from examining expanded QK/OV matrices showing how specific programming language structure is encoded in attention weights
- Hypothesis tested in Experiment 3; independently trained GPT, Claude, Gemini architectures converge on similar descriptive vocabulary
- Token-level supervision enables models to learn functional-token invocation from reasoning contextclaim0.742ATLAS author's assertion that functional tokens optimized via standard cross-entropy loss learn when and how to invoke operations from surrounding text.