claim
active
claim:induction-heads-explain-in-context-learning-in-small-models-and-only-develop-in-models-with-at-least-two-attention-layersInduction heads explain in-context learning in small models and only develop in models with at least two attention layers
Central empirical claim of the paper; induction heads are shown to be the mechanism for powerful in-context learning
Neighborhood — ranked by edge-count
Findings (2)
finding
- Quantitative verification of the mechanistic theory; both circuits required for the induction algorithm show the predicted copying/matching structure
- Strong test of the induction head hypothesis using uniformly sampled random tokens repeated three times
Hypotheses (1)
hypothesis
- Central motivating hypothesis for the forthcoming paper on in-context learning and induction heads
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- A follow-up paper extending the framework and induction head concept to larger more realistic models
- Forward-looking claim connecting toy model findings to large-scale language models
- The mechanistic explanation of how induction heads are implemented in two-layer models
- Concrete example from examining expanded QK/OV matrices showing how specific programming language structure is encoded in attention weights
- Empirical observation from examining expanded OV/QK matrices; approximately 10 out of 12 heads show significant copying
- Mathematical equivalence enabling independent analysis of each attention head
- Describes scaffolding method and the model's meta-learning loop.
- Result from term importance analysis breaking down loss contribution by layer