concept
active
concept:in-context-learning-and-induction-heads-forthcoming-paperIn-Context Learning and Induction Heads (forthcoming paper)
A follow-up paper extending the framework and induction head concept to larger more realistic models
Neighborhood — ranked by edge-count
Papers (2)
paper
Hypotheses (1)
hypothesis
- The mathematical framework and induction head concept will remain at least partially relevant for larger, more realistic modelsassociated_withCentral motivating hypothesis for the forthcoming paper on in-context learning and induction heads
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Central empirical claim of the paper; induction heads are shown to be the mechanism for powerful in-context learning
- Reports phase-like breakpoints and geometry changes as context scales; UCCT provides measurable predictor
- Forward-looking claim connecting toy model findings to large-scale language models
- Strong test of the induction head hypothesis using uniformly sampled random tokens repeated three times
- Test-time adaptation from prompt or retrieved context with no parameter updates.
- Mechanistic circuits in transformers documented by Olsson et al. 2022, cited as evidence for pattern-repository assumption
- The mechanistic explanation of how induction heads are implemented in two-layer models