concept
active
concept:sparsity-reconstruction-tradeoffSparsity-reconstruction tradeoff
The balance between how sparse and how faithful a decomposition is; VPD achieves a better tradeoff than transcoders.
Neighborhood — ranked by edge-count
Papers (1)
paper
Claims (1)
claim
- Quantitative advantage claimed for VPD over a prior activation-decomposition method.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Empirical result demonstrating VPD's efficiency advantage in parameter decomposition.
- Metric of how well models reconstruct information from hidden states; Sauers' study found showing janus thread extends distribution tails.
- Statistically rigorous analysis of Claude introspection; suggests models may have latent introspective capabilities that can be enhanced or disrupted.
- Opening sentence defining self-evidencing.
- Concise statement of the free-energy principle's unification of action and perception.
- Longstanding debate from probing literature about whether complex probes reveal genuine encodings or just memorise; this paper revives it for causal abstraction
- giving models janus's thread extends reconstruction accuracy distribution tails in both directionsfinding0.700Sauers' study: exposing models to janus's post extended both positive and negative extremes of reconstruction accuracy.