claim
active
claim:manifold-level-descriptions-recover-overarching-semantic-structure-that-sae-features-miss

Manifold-level descriptions recover overarching semantic structure that SAE features miss.

Positive claim that geometric descriptions retain the conceptual coherence lost in atomized feature decompositions.

Source paper

extracted_from
The World Inside Neural Networks
(2026) · Geiger, Atticus · Lubana, Ekdeep Singh · Fel, Thomas · Merullo, Jack +3

Neighborhood — ranked by edge-count

Concepts (3)

concept
  • The individual, supposedly monosemantic directions learned by SAEs; argued here to fragment manifolds into disconnected pieces.
  • The meaningful organization of concepts in a model's representation space, claimed to be better captured by manifolds than by SAEs.
  • An interpretability approach that describes representations in terms of entire curved manifolds rather than many small features.

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.