concept
active
concept:curved-manifoldcurved manifold
A smoothly varying lower-dimensional surface in activation space that captures a concept better than a straight linear direction.
Neighborhood — ranked by edge-count
Claims (1)
claim
- Proposes that nonlinear geometric structure is superior to linear feature spaces for capturing semantic content.
Concepts (1)
concept
- manifoldrelated_toA smooth, potentially curved surface in activation space along which activations vary according to a coherent semantic dimension.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The type of manifold fitted to the cyclic concept structure in both activation and behavior space — a path along which steering moves the model.
- An interpretability approach that describes representations in terms of entire curved manifolds rather than many small features.
- Hypothesized extension of superposition where features may be higher-dimensional manifolds rather than 1D directions
- A single-continuous curve in activation space encoding a single variable, such as car position in the Mountain Car case.
- The actual shapes and spatial relationships of buildings, essential to living structure.
- One-dimensional curved surface in output probability space; the paper shows this mirrors representation manifold structure.
- Technique used to fit M_h and M_y from data; enables manifold steering.
- One-dimensional curved surface in internal activation space; the paper demonstrates alignment with behavior manifold.