framework
active
framework:geometry-aware-steering-framework

Geometry-Aware Steering Framework

The overarching theoretical framework proposed in the paper, asserting that steering interventions should be aligned with the geometric structure of the model's representation manifold.

Neighborhood — ranked by edge-count

Concepts (3)

concept
  • Central framework: steering neural networks by intervening along the curved manifold where a concept lives, rather than in straight lines through activation space.
  • One-dimensional curved surface in output probability space; the paper shows this mirrors representation manifold structure.
  • One-dimensional curved surface in internal activation space; the paper demonstrates alignment with behavior manifold.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.