claim
active
claim:attention-heads-can-be-understood-as-independent-operations-each-adding-their-output-to-the-residual-stream-equivalent-to-the-concatenate-and-multiply-formulationAttention heads can be understood as independent operations each adding their output to the residual stream, equivalent to the concatenate-and-multiply formulation
Mathematical equivalence enabling independent analysis of each attention head
Neighborhood — ranked by edge-count
Claims (1)
claim
- Key decomposition enabling separate analysis of where attention goes and what it does
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Interesting special case of copying behavior related to tokenization artifacts; primitive precursor to induction heads
- Mathematical equivalence showing the relationship between attention mechanisms and convolutional operations
- Attention computations distribute across heads via parameter subcomponents with interpretable rolesfinding0.793Mechanistic discovery about how attention mechanisms decompose into interpretable parameter components.
- Result from term importance analysis breaking down loss contribution by layer
- Central empirical claim of the paper; induction heads are shown to be the mechanism for powerful in-context learning
- Claim supported by VPD's recovery of cross-head attention subcomponents, noted in footnote.
- Long-standing bottleneck in mechanistic interpretability that VPD addresses by working natively on attention weight matrices.
- Identification of algorithms implemented in attention layers, distributed across attention headsfinding0.769VPD successfully recovered interpretable attention algorithms (previous-token behavior, syntax-boundary routing) in weight space without requiring manual decomposition across heads.