hypothesis
active
hypothesis:virtual-attention-heads-v-composition-may-be-much-more-important-in-larger-and-more-complex-transformers-than-in-two-layer-toy-modelsVirtual attention heads (V-composition) may be much more important in larger and more complex transformers than in two-layer toy models
Forward-looking speculation based on the theoretical elegance and combinatorial growth of virtual head count with depth
Neighborhood — ranked by edge-count
Findings (1)
finding
- Result of term importance analysis ablation experiment; justifies focusing on individual head terms
Claims (1)
claim
- Finding from term importance analysis; allows focus on individual head terms rather than their compositions
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Empirical observation from the specific two-layer model analyzed; no significant V- or Q-composition found
- Core claim for two-layer models; composition creates qualitatively more powerful in-context learning
- Empirical observation from examining expanded OV/QK matrices; approximately 10 out of 12 heads show significant copying
- Response to the 'attention as explanation' critique; the paper provides a typology of when attention is and isn't directly interpretable
- The paper explicitly asks and addresses this question, concluding the answer depends on what 'fully understand' means
- Result from term importance analysis breaking down loss contribution by layer
- Result from applying the Frobenius norm composition measurement to all attention head pairs in the two-layer model
- If models inhabit expanded attentional modes, they may be more aligned and less prone to psychosis and doom spirals.hypothesis0.741Speculative alignment implication drawn from the collapsed/expanded distinction.