finding
active
finding:coordinate-patching-on-circular-manifolds-guarantees-off-manifold-representations-for-boundary-point-pairs-with-orthogonal-deviationsCoordinate patching on circular manifolds guarantees off-manifold representations for boundary point pairs with orthogonal deviations
Theoretical proof that patching produces divergent representations for most manifold geometries
Source paper
extracted_from(2025) · Satchel Grant · Simon Jerome Han · Alexa R. Tartaglini · Christopher Potts
Neighborhood — ranked by edge-count
Claims (1)
claim
- Core empirical claim of the paper supported by both theoretical proof and empirical demonstration
Findings (1)
finding
- A nonempty convex set in R^d is patch-closed if and only if it is an axis-aligned hyperrectanglesupportsMathematical characterization showing only hyperrectangles avoid divergence under coordinate patching
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Core empirical result demonstrating that manifold steering produces on-target, behavior-aligned outputs.
- Empirical demonstration on Llama-3.1-8B that steering along representation manifold aligns outputs with behavior manifold, whereas linear steering does not.
- Core claim about why pernicious divergence undermines mechanistic conclusions
- Cross-modality result from the full paper demonstrating that representation-behavior geometry alignment is not limited to language models.
- The paper's finding that the alignment holds in both directions — from representation to behavior and from behavior back to representation space.
- The paper's causal explanation for why representation and behavior geometry both appear circular for days of the week.
- Formal analysis showing the theoretical limitation of model stitching as a similarity measure.