finding
active
finding:decomposition-of-all-24-weight-matrices-in-a-67m-parameter-lm-yields-10-000-parameter-subcomponentsDecomposition of all 24 weight matrices in a 67M-parameter LM yields ~10,000 parameter subcomponents
Quantitative result of VPD application; the network's 24 matrices decompose into approximately 10,000 rank-one subcomponents.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Claims (1)
claim
- Central claim that VPD successfully uncovers genuine mechanisms.
Communities (3)
community
- Spans attention head decomposition, benchmark awareness, and genomic pathogenicity prediction via neural models.
- Tracing information flow through weight matrices and attention heads using attribution graphs to identify causally important subcomponents in language models.
- Tracing information flow through parameter subcomponents to isolate computational mechanisms for specific model predictions, using tools like attribution graphs and VPD.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The core idea of decomposing weight matrices into components for interpretability.
- The layer approximately two-thirds through an LLM's transformer stack, reported to best predict human brain activity; identified as promising for consciousness indicators.
- A 337-character contemplative system prompt lifts all 28 models by +2.62 points on a 10-point scale.finding0.705Core empirical result: every model, every architecture, every alignment type responds to the contemplative prompt with measurable gain.
- One of the simple rank-one matrices resulting from VPD that sums with others to reconstruct the original model weights and has a specific functional role.
- What matrix decomposition or dimensionality reduction best summarizes the enormous low-rank OV and QK matrices?question0.701Open methodological question about converting the 50k x 50k expanded matrices into human-graspable summaries
- Estimation from 50,000 m³ volume and average cell volume 100 cm³.
- Specific discovered subcomponent that activates on punctuation like ' :', ' ;', ' =', ':-' and predicts the rest of emoticons/emojis.
- Interpretive claim that the subcomponents correspond to real functional units.