method
active
method:value-weighted-attention-pattern-visualizationValue-Weighted Attention Pattern Visualization
Visualizing attention patterns weighted by the norm of value vectors to better show how much information is moved from each position
Neighborhood — ranked by edge-count
Papers (1)
paper
Thinkers (1)
thinker
- Kobayashi et al.introducesRecently introduced value-weighted attention patterns; the paper's approach mirrors and extends this
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Attention patterns scaled by the norm of the value vector at each source position, showing how large a vector is moved from each position
- Method of optimizing input to cause a neuron to fire maximally, used to characterize what a neuron detects; establishes causal link
- Technique of building a fluid, three-dimensional vision by closing one's eyes, relying on words and feeling to avoid arbitrary graphical over-specification.
- Process using Q, K, V to compute a heat map over K and weighted sum of V.
- Visual reasoning tasks often interleaved with intermediate visual states; promising direction in the field.
- Used to visualize LLM true/false representations, revealing clear linear structure separating true from false statements
- Identification of algorithms implemented in attention layers, distributed across attention headsfinding0.739VPD successfully recovered interpretable attention algorithms (previous-token behavior, syntax-boundary routing) in weight space without requiring manual decomposition across heads.
- Mathematical equivalence showing the relationship between attention mechanisms and convolutional operations