claim
active
claim:there-is-a-clear-bidirectional-relationship-between-the-geometry-of-behavior-and-representation-steering-along-representation-manifolds-follows-behavior-manifolds-and-vice-versaThere is a clear bidirectional relationship between the geometry of behavior and representation: steering along representation manifolds follows behavior manifolds, and vice versa.
The paper's finding that the alignment holds in both directions — from representation to behavior and from behavior back to representation space.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Papers (1)
paper
Findings (1)
finding
- Key empirical result showing that optimizing for behavioral outputs and fitting representation geometry produce the same path in activation space.
Claims (2)
claim
- Author’s interpretive claim that the shared geometry is general and robust.
- The paper's deepest interpretive claim, asserting that representation structure and behavioral structure are not coincidentally aligned but deeply connected.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Core finding: the structure models use internally (representations) is precisely reflected in their external behavior (outputs).
- The finding that steering along M_h yields M_y behavior, and optimizing for M_y paths recovers M_h trajectories.
- Central empirical claim of the paper, demonstrated across tasks and modalities
- The paper's generalization claim, asserting that the days-of-week finding scales to other cyclic and structured concepts.
- Extension of manifold steering validation to video world models and physical dynamics tasks, demonstrating cross-modal generality
- The causal hypothesis motivating the use of causality (intervention) as the lens connecting representation and behavior geometry.
- The paper's causal explanation for why representation and behavior geometry both appear circular for days of the week.
Restated by (1)
cosine ≥ 0.90Other entities that say roughly the same thing. May be merge candidates or independent restatements across papers.