concept
active
concept:two-layer-attention-only-transformerTwo-Layer Attention-Only Transformer
The primary model analyzed; uses attention head composition, especially K-composition, to create induction heads for powerful in-context learning
Neighborhood — ranked by edge-count
Concepts (3)
concept
- Attention-Only Transformerrelated_toA simplified transformer variant without MLP layers, used as the primary subject of mechanistic analysis in this paper
- One-Layer Attention-Only Transformerrelated_toThe first toy model analyzed; shown to implement an ensemble of bigram and skip-trigram models readable directly from weights
- Induction HeadsimplementsMechanistic circuits in transformers documented by Olsson et al. 2022, cited as evidence for pattern-repository assumption
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Core claim for two-layer models; composition creates qualitatively more powerful in-context learning
- The paper explicitly asks and addresses this question, concluding the answer depends on what 'fully understand' means
- Core claim for one-layer models; the skip-trigram tables can be accessed without running the model
- Empirical observation from the specific two-layer model analyzed; no significant V- or Q-composition found
- A transformer with no attention layers; shown to model bigram statistics via T = W_U W_E
- Vivid characterization of the limits of understanding after converting to skip-trigram form: no algorithmic mystery remains but the sheer scale prevents holistic comprehension
- Antra's revision of her earlier model; still considers interference between levels important.
- Result from term importance analysis breaking down loss contribution by layer