concept
active
concept:principled-control-via-internals-using-geometryprincipled control via internals using geometry
Claim that geometry enables accurate intervention; steering moves from direction-finding to geometry-finding.
Neighborhood — ranked by edge-count
Claims (1)
claim
- Core interpretive assertion: geometric structure is causally load-bearing, not epiphenomenal.
Concepts (1)
concept
- The goal of mechanistically-grounded, reliable control of neural network behavior via activation interventions
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The actual shapes and spatial relationships of buildings, essential to living structure.
- What is the right geometry for enabling principled steering of neural network behavior?question0.745The reframed steering problem the paper introduces
- Strong interpretive assertion linking discovery and control: neural computation is fundamentally manifold-structured.
- General technique of modifying activations to control model behavior.
- Models can modulate their internal representations when instructed or incentivized to 'think about' a concept; effect replicates across all tested models regardless of capability.
- Strong statement that all qualitative aspects of places and situations are produced by the spatial system of centers.