hypothesis
active
hypothesis:manifold-geometry-provides-a-practical-blueprint-for-steering-model-behavior-across-diverse-tasks-and-modalitiesManifold geometry provides a practical blueprint for steering model behavior across diverse tasks and modalities.
The generalizing predictive claim that manifold steering is a broadly applicable framework beyond the days-of-week case study.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Papers (1)
paper
Findings (2)
finding
- Analogous alignment between representation manifold and behavior manifold is found across months, letters, ages, and synthetic in-context learning tasks in language models.associated_withsupportsGeneralization finding from the full paper extending beyond days-of-week to other structured concepts.
- Cross-modality result from the full paper demonstrating that representation-behavior geometry alignment is not limited to language models.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Extension of manifold steering validation to video world models and physical dynamics tasks, demonstrating cross-modal generality
- The paper's finding that the alignment holds in both directions — from representation to behavior and from behavior back to representation space.
- Evidence that the weekday cyclic structure is not anomalous but reflects broader principle of concept geometry.
- The central thesis of the paper, motivating the shift from linear to geometry-aware manifold steering.
- Proposes that nonlinear geometric structure is superior to linear feature spaces for capturing semantic content.
- Empirical demonstration on Llama-3.1-8B that steering along representation manifold aligns outputs with behavior manifold, whereas linear steering does not.