method
active
method:activation-manifold-fitting-m-hactivation manifold fitting (M_h)
Method to fit a manifold M_h to neural representations in activation space.
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (1)
concept
- Central framework: steering neural networks by intervening along the curved manifold where a concept lives, rather than in straight lines through activation space.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Manifold fitted to representations in activation space.
- The low-dimensional geometric structure discovered in neural activation space; contrasted with linear/Euclidean geometry.
- Method to fit a manifold M_y to output probability distributions.
- Manifold fitted to output probability distributions (behavior).
- The procedure of fitting a one-dimensional manifold (path) to clusters in activation or behavior space to capture the geometric structure of a concept.
- Central empirical result showing causal coupling between representation and behavior geometry across multiple substrates and modalities.