concept
active
concept:bottom-up-interpretability

Bottom-up interpretability

An interpretability paradigm that explains computation in the model's own terms, rather than imposing top-down abstractions; VPD aims to realize this.

Neighborhood — ranked by edge-count

Claims (1)

claim

Methods (1)

method

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.