question
active
question:what-matrix-decomposition-or-dimensionality-reduction-best-summarizes-the-enormous-low-rank-ov-and-qk-matrices

What matrix decomposition or dimensionality reduction best summarizes the enormous low-rank OV and QK matrices?

Open methodological question about converting the 50k x 50k expanded matrices into human-graspable summaries

Source paper

extracted_from
A Mathematical Framework for Transformer Circuits
(2021) ·

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.