claim
active
claim:the-difference-in-means-direction-is-the-unique-nullity-1-projection-kernel-that-eliminates-all-linearly-recoverable-binary-classification-information-from-a-dataset

The difference-in-means direction is the unique nullity-1 projection kernel that eliminates all linearly-recoverable binary classification information from a dataset

Formal consequence of Belrose et al. (2023) Theorem G.1 connecting mass-mean probing to optimal linear concept erasure

Neighborhood — ranked by edge-count

Frameworks (1)

framework
  • Introduced in this paper: an optimization-free probing technique using difference-in-means direction with optional covariance correction

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.