claim
active
claim:steering-along-manifolds-provides-better-control-than-linear-steering-when-the-concept-geometry-is-non-linearSteering along manifolds provides better control than linear steering when the concept geometry is non-linear.
The central thesis of the paper, motivating the shift from linear to geometry-aware manifold steering.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Papers (1)
paper
Findings (2)
finding
- Linear steering on Llama-3.1 8B for the days-of-week task cuts across the behavior manifold, producing noisy off-target effects where predicted tokens are not even days of the week.associated_withsupportsEmpirical result demonstrating the failure mode of linear steering when concept geometry is cyclic.
- Core empirical result demonstrating that manifold steering produces on-target, behavior-aligned outputs.
Questions (1)
question
- The motivating question that opens the paper and leads to the development of manifold steering.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Core empirical claim comparing steering approaches on cyclic concepts.
- Empirical demonstration on Llama-3.1-8B that steering along representation manifold aligns outputs with behavior manifold, whereas linear steering does not.
- Proposes that nonlinear geometric structure is superior to linear feature spaces for capturing semantic content.
- Attribution of failure to Euclidean assumption.
- The research gap that motivates manifold steering as an alternative to conventional linear approaches
- The paper's critique of the standard linear steering baseline, supported by the days-of-week demo.