finding
active
finding:steering-llama-3-1-8b-along-the-circular-representation-manifold-produces-outputs-that-follow-the-natural-circle-of-the-behavior-manifold-cleanly-shifting-probability-mass-from-monday-through-successive-daysSteering Llama-3.1 8B along the circular representation manifold produces outputs that follow the natural circle of the behavior manifold, cleanly shifting probability mass from Monday through successive days.
Core empirical result demonstrating that manifold steering produces on-target, behavior-aligned outputs.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Papers (1)
paper
Claims (1)
claim
- Steering along manifolds provides better control than linear steering when the concept geometry is non-linear.associated_withsupportsThe central thesis of the paper, motivating the shift from linear to geometry-aware manifold steering.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Empirical result demonstrating the failure mode of linear steering when concept geometry is cyclic.
- Empirical observation establishing that Llama's behavior for days-of-week tasks has circular structure.
- Empirical demonstration on Llama-3.1-8B that steering along representation manifold aligns outputs with behavior manifold, whereas linear steering does not.
- Core empirical claim comparing steering approaches on cyclic concepts.
- Empirical observation establishing that Llama's internal representations for days-of-week have circular geometric structure.
- Illustrative finding that ESR mitigates but does not fully eliminate steering influence
- The complete mechanistic algorithm discovered for cyclic concept reasoning