concept
active
concept:behavioral-trajectoryBehavioral Trajectory
The path traced through output probability distribution space as interventions are applied to activations
Neighborhood — ranked by edge-count
Concepts (1)
concept
- behavior manifold M_yassociated_withManifold fitted to output probability distributions (behavior).
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The path in activation space derived by optimizing steering interventions to produce outputs along the behavior manifold, independent of representation geometry.
- Observable behavioral pattern used to infer cognition; shared by plants and animals and proposed as evidence for sentience.
- Strategic filtering procedure that removes invalid trajectories and maintains optimal positive-to-negative trajectory ratio to stabilize training.
- The traditional space of movement in the physical world where animals exhibit problem-solving behavior.
- Grouping similar model behaviors; the unsupervised method surfaces clusters of concerning patterns.
- The preservation of unrelated model capabilities after a targeted intervention, operationalized via KL divergence on Alpaca
- The behavior a model would exhibit during real-world deployment, as opposed to evaluation behavior; the target of steering.
- Organism's belief-guided action selection that instantiates generative model and maintains phenotypic states