finding
active
finding:some-attention-heads-partially-specialize-in-copying-for-words-that-split-into-two-tokens-without-a-space-prefix-attending-from-fragmented-token-to-complete-tokenSome attention heads partially specialize in copying for words that split into two tokens without a space prefix, attending from fragmented token to complete token
Interesting special case of copying behavior related to tokenization artifacts; primitive precursor to induction heads
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- A pair of query and key subcomponents distributed across attention heads performs previous-token behaviorfinding0.822VPD recovers an attention algorithm for attending to the previous token, distributed across multiple heads.
- Empirical observation from examining expanded OV/QK matrices; approximately 10 out of 12 heads show significant copying
- Mathematical equivalence enabling independent analysis of each attention head
- Claim supported by VPD's recovery of cross-head attention subcomponents, noted in footnote.
- Speculation about the mechanistic basis of the distinguishing thoughts from text experiment.
- Attention computations distribute across heads via parameter subcomponents with interpretable rolesfinding0.786Mechanistic discovery about how attention mechanisms decompose into interpretable parameter components.
- Concrete example from examining expanded QK/OV matrices showing how specific programming language structure is encoded in attention weights
- Interpretive claim about the mechanistic substrate of introspection in LLMs