claim
active
claim:attention-is-a-generalization-of-convolution-all-convolutions-can-be-expressed-as-tensor-products-of-fixed-relative-position-attention-patterns-and-weight-matricesAttention is a generalization of convolution; all convolutions can be expressed as tensor products of fixed relative position attention patterns and weight matrices
Mathematical equivalence showing the relationship between attention mechanisms and convolutional operations
Neighborhood — ranked by edge-count
Thinkers (1)
thinker
- Cordonnier et al.supportsExplored the equivalence between attention and convolution and empirically found that vision models often have many 2D relative position heads
Concepts (1)
concept
- Virtual Attention HeadsupportsThe composition of two attention heads via V-composition, forming a new entity with its own attention pattern A^h2 * A^h1 and OV matrix W_OV^h2 * W_OV^h1
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Mathematical equivalence enabling independent analysis of each attention head
- Claim supported by VPD's recovery of cross-head attention subcomponents, noted in footnote.
- Paper's operational description of consciousness as conductor — mechanism for the coherence definition
- The Attention Schema Theory: A Foundation for Engineering Artificial Consciousness (Graziano, 2017)concept0.782Paper providing the biological framework analogy for ESR as a form of attentional control
- Identification of algorithms implemented in attention layers, distributed across attention headsfinding0.781VPD successfully recovered interpretable attention algorithms (previous-token behavior, syntax-boundary routing) in weight space without requiring manual decomposition across heads.
- Justifies the methodological choice of attention over concatenation, mean pooling, residual connections, or joint embedding.
- Precision is the unified mechanism for attention and meta-awareness across hierarchical levels.claim0.774Core theoretical claim: attention = precision over sensory evidence; meta-awareness = precision over attentional states; meditation = precision modulation.
- Response to the 'attention as explanation' critique; the paper provides a typology of when attention is and isn't directly interpretable