concept
active
concept:sparse-interpretability-in-neural-networks

Sparse interpretability in neural networks

VPD achieves sparse, interpretable parameter subcomponents with improved sparsity-reconstruction tradeoff.

Neighborhood — ranked by edge-count

Concepts (1)

concept
  • The field aimed at understanding what neural networks have learned; characterized as pre-paradigmatic in this paper

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.