method
active
method:manifold-fitting-to-representation-behavior-spaceManifold Fitting to Representation/Behavior Space
The procedure of fitting a one-dimensional manifold (path) to clusters in activation or behavior space to capture the geometric structure of a concept.
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Central framework: steering neural networks by intervening along the curved manifold where a concept lives, rather than in straight lines through activation space.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Method to fit a manifold M_y to output probability distributions.
- One-dimensional curved surface in output probability space; the paper shows this mirrors representation manifold structure.
- Generalization finding from the full paper extending beyond days-of-week to other structured concepts.
- One-dimensional curved surface in internal activation space; the paper demonstrates alignment with behavior manifold.
- Manifold fitted to output probability distributions (behavior).
- The paper's causal explanation for why representation and behavior geometry both appear circular for days of the week.
- The paper's finding that the alignment holds in both directions — from representation to behavior and from behavior back to representation space.