concept
active
concept:behavior-spaceBehavior Space
A geometric space of all output token probability distributions, equipped with Hellinger distance, used to visualize model behavior.
Neighborhood — ranked by edge-count
Methods (1)
method
- Metric used to define geometric space of output token probability distributions in behavior manifold analysis.
Concepts (2)
concept
- Behavioral Null Spacerelated_toThe span of vector directions that do not change network behavior; a key concept distinguishing MAS from model stitching.
- behavior manifoldassociated_withOne-dimensional curved surface in output probability space; the paper shows this mirrors representation manifold structure.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The traditional space of movement in the physical world where animals exhibit problem-solving behavior.
- A vector subspace that causally impacts outputs only through the sign of its values, enabling harmless magnitude divergence
- The ensemble of all possible configurations of a building, including incomplete states and paths between them.
- Representation space on which linear probes operate to attribute harmful behaviors to training data.
- The procedure of fitting a one-dimensional manifold (path) to clusters in activation or behavior space to capture the geometric structure of a concept.
- The path in activation space derived by optimizing steering interventions to produce outputs along the behavior manifold, independent of representation geometry.