finding
active
finding:10-out-of-12-attention-heads-in-the-12-head-one-layer-model-show-significantly-positive-eigenvalue-sums-indicating-copying-behavior

10 out of 12 attention heads in the 12-head one-layer model show significantly positive eigenvalue sums, indicating copying behavior

Quantitative result from eigenvalue analysis of expanded OV matrices; confirmed by qualitative inspection

Source paper

extracted_from
A Mathematical Framework for Transformer Circuits
(2021) ·

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.