Geometry-Aware Steering Framework

The overarching theoretical framework proposed in the paper, asserting that steering interventions should be aligned with the geometric structure of the model's representation manifold.

Neighborhood — ranked by edge-count

Papers (1)

paper

Steering Along Manifolds to Control Neural Networks
introduces

Concepts (3)

concept

Manifold Steering
implements
Central framework: steering neural networks by intervening along the curved manifold where a concept lives, rather than in straight lines through activation space.
behavior manifold
uses
One-dimensional curved surface in output probability space; the paper shows this mirrors representation manifold structure.
representation manifold
uses
One-dimensional curved surface in internal activation space; the paper demonstrates alignment with behavior manifold.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

geometry-based steeringconcept0.844
Paradigm of finding the right geometry (manifold) for principled control.
Neural Geometry Frameworkframework0.777
Conceptual scheme introduced in this paper: neural networks develop internal geometric representations that mirror real-world geometry, providing the right level of description for interpretability and control.
Psychological Steering Frameworkframework0.761
The paper's primary contribution: performs unbounded, fluency-constrained sweeps in semantically calibrated centroid units using psychological artifacts
Euclidean geometry assumption in steeringconcept0.748
Linear steering implicitly assumes a flat, Euclidean activation space, leading to off-manifold excursions.
Sparse Autoencoder-based Framework for Steering Semantic Featuresframework0.746
The main framework proposed for retrieving and steering high-order semantic features in LLMs via sparse autoencoders.
direction-based steeringconcept0.738
Paradigm of finding the right direction in activation space (e.g., linear steering).
Manifold-aware steering is non-trivial IP requiring geometric analysis, not a system-prompt implementation.claim0.731
Pullback Geometry (Behavior-Aware Metric)concept0.720