concept
active
concept:path-expansion-trickPath Expansion Trick
The mathematical trick of expanding a product of layer terms into a sum of end-to-end path terms, enabling independent analysis of each term
Neighborhood — ranked by edge-count
Thinkers (1)
thinker
- Dong et al.studiesPrior work that considered paths through a self-attention network in analyzing transformer expressivity, deriving the same path expansion structure
Frameworks (1)
framework
- Prior Anthropic paper enabling circuit-level analysis of attention-only transformers; motivates current MLP decomposition
Claims (1)
claim
- Architectural observation enabling the entire mathematical framework; the residual stream is purely a sum of linear projections
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The core analytical technique of expanding transformer computations from layer-by-layer products into sums of end-to-end path terms for independent analysis
- Method by Goldowsky-Dill et al. 2023 for localizing model behavior via targeted activation interventions
- Neural mechanism for tracking location through accumulation of self-movement vectors; shown to play the role of position encodings in TEM.
- Specific implementation question about land acquisition for the pedestrian hull.
- The number of distinct paths information can travel from point A to B in a transformer is C(m+n, n), quickly exceeding the number of atoms in the universe.
- The path in activation space derived by fitting the representation manifold, used to steer along the geometric structure of internal representations.
- Quantifies extreme redundancy in transformer routing; supports claim that introspection and interference patterns are architecturally permitted.
- Feynman's quantum method where global behavior of light emerges from local behaviors with assigned probabilities; cited as example of global emerging from local