claim
active
claim:naive-interpretation-of-attention-patterns-can-be-both-informative-and-fundamentally-misleading-when-q-k-or-v-composition-is-presentNaive interpretation of attention patterns can be both informative and fundamentally misleading when Q-, K-, or V-composition is present
Response to the 'attention as explanation' critique; the paper provides a typology of when attention is and isn't directly interpretable
Neighborhood — ranked by edge-count
Papers (1)
paper
Claims (1)
claim
- Key decomposition enabling separate analysis of where attention goes and what it does
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Result from applying the Frobenius norm composition measurement to all attention head pairs in the two-layer model
- Finding from term importance analysis; allows focus on individual head terms rather than their compositions
- Suggests LLMs do not represent complement/MSV linguistic features in the same way as they are crucial for human ToM development.
- Load-bearing epistemic caution the author places on the entire analytical framework.
- Call to extend the inference of sentience to non-biological systems as well.
- Canonical illustration of the Hard Problem intuition that any functional/mechanical explanation faces an explanatory gap for perception
- Mathematical equivalence showing the relationship between attention mechanisms and convolutional operations