VPD achieves a better sparsity-reconstruction tradeoff than transcoders.

Quantitative advantage claimed for VPD over a prior activation-decomposition method.

Source paper

extracted_from

Interpreting Language Model Parameters

(2026) · Bushnaq, Lucius · Braun, Dan · Clive-Griffin, Oliver · Bussmann, Bart +4

Neighborhood — ranked by edge-count

Concepts (1)

concept

Sparsity-reconstruction tradeoff
cites
The balance between how sparse and how faithful a decomposition is; VPD achieves a better tradeoff than transcoders.

Methods (1)

method

Transcoders
cites
Decomposition method for activations; VPD is compared against transcoders in sparsity-reconstruction tradeoff.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

VPD achieves better sparsity-reconstruction tradeoff than transcoders on 67M modelfinding0.946
Empirical result demonstrating VPD's efficiency advantage in parameter decomposition.
VPD subcomponents are sparse, interpretable, and avoid feature splitting.claim0.790
Assertion about the qualitative advantages of VPD's rank-one decomposition.
VPD is a meaningful step toward bottom-up interpretabilityclaim0.758
Positioning of VPD as advancing the paradigm of explaining computation in the model's terms.
VPD decomposes parameters, not activations, flipping the standard SAE / activation-patching paradigm.claim0.756
Core proposition of the paper: a substrate-level critique of existing interpretability methods.
The VPD-based edit has similarly low off-target effects as uninterpretable fine-tuning methodsfinding0.755
Performance comparison showing subcomponent editing is comparable to fine-tuning in preserving off-target behavior.
VPD enables manual model editing through direct parameter manipulation.claim0.746
Applied capability claim: VPD enables surgical changes to model behaviour at the parameter level.
probably helps not only with faithful reconstruction but also creates interference patterns that encode nuanced information about the deltas and convergences between states.quote0.746
Key quote connecting path redundancy to interferometric information encoding.
The ability to make precise edits demonstrates that VPD identifies real computational machineryclaim0.745
Claim that editing success validates VPD's decomposition.