concept
active
concept:behavior-manifold-m-ybehavior manifold M_y
Manifold fitted to output probability distributions (behavior).
Neighborhood — ranked by edge-count
Concepts (4)
concept
- behavior manifoldrelated_tosame_asOne-dimensional curved surface in output probability space; the paper shows this mirrors representation manifold structure.
- activation manifold M_hassociated_withManifold fitted to representations in activation space.
- neural behavior geometryassociated_withThe manifold structure of model outputs, modelled by M_y.
- Behavioral Trajectoryassociated_withThe path traced through output probability distribution space as interventions are applied to activations
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Method to fit a manifold M_y to output probability distributions.
- Method that optimizes activation interventions so that resulting behaviors trace M_y, recovering activation paths that follow M_h curvature.
- The procedure of fitting a one-dimensional manifold (path) to clusters in activation or behavior space to capture the geometric structure of a concept.
- Central empirical result showing causal coupling between representation and behavior geometry across multiple substrates and modalities.
- Generalization finding from the full paper extending beyond days-of-week to other structured concepts.
- A geometric space of all output token probability distributions, equipped with Hellinger distance, used to visualize model behavior.