hypothesis
active
hypothesis:we-hypothesize-that-interventions-that-respect-the-geometry-of-activation-space-will-yield-behaviors-close-to-those-the-model-exhibits-naturally

We hypothesize that interventions that respect the geometry of activation space will yield behaviors close to those the model exhibits naturally

The core testable hypothesis driving the experimental design

Source paper

extracted_from
Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior
(2026) · Daniel Wurgaft · Can Rager · Matthew Kowal · Vasudev Shyam +12

Neighborhood — ranked by edge-count

Findings (1)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.