hypothesis
active
hypothesis:we-hypothesize-that-representation-geometry-drives-model-behavior-the-geometric-structure-of-internal-representations-causally-shapes-what-models-do-externallyWe hypothesize that representation geometry drives model behavior — the geometric structure of internal representations causally shapes what models do externally.
The causal hypothesis motivating the use of causality (intervention) as the lens connecting representation and behavior geometry.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Papers (1)
paper
Findings (1)
finding
- Key empirical result showing that optimizing for behavioral outputs and fitting representation geometry produce the same path in activation space.
Claims (1)
claim
- The paper's deepest interpretive claim, asserting that representation structure and behavioral structure are not coincidentally aligned but deeply connected.
Concepts (1)
concept
- The use of interventions (rather than correlations) to establish a causal link between representation geometry and behavioral geometry.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The central scientific question the paper addresses through the lens of interventional causality.
- The motivating research question of the paper
- The paper's core causal assertion: geometry is not incidental but mechanistically linked to behavior
- The paper's finding that the alignment holds in both directions — from representation to behavior and from behavior back to representation space.
- Author’s interpretive claim that the shared geometry is general and robust.
- Opening sentence framing the paper's core inquiry.
- Central empirical claim of the paper, demonstrated across tasks and modalities