claim
active
claim:zero-shot-model-stitching-without-a-learned-stitching-layer-is-feasible-because-different-text-models-embed-data-in-remarkably-similar-waysZero-shot model stitching without a learned stitching layer is feasible because different text models embed data in remarkably similar ways
Strong evidence for representational alignment across models
Source paper
extracted_from(2024) · Minyoung Huh · Brian Cheung · Tongzhou Wang · Phillip Isola
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Zero-Shot Model Stitchingassociated_withModel stitching without learning a stitching layer, demonstrating strong alignment across different model training regimes
Claims (1)
claim
- Primary empirical claim of the paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Moschella et al. result cited as evidence of representational convergence across models
- Formal analysis showing the theoretical limitation of model stitching as a similarity measure.
- Demonstrated transformers on mathematical understanding and logic; cited to motivate transformer versatility.
- Articulates why a one-layer transformer with MLP is the appropriate starting target for mechanistic interpretability
- Proposed practical method for achieving step-by-step feedback in design.
- Core interpretive claim supported by the formal analysis showing MAS does not exploit the behavioral null space unlike stitching.
- Technique to measure representational compatibility by integrating intermediate representations of one model into another
- Alternative hypothesis for how experience reports arise without explicit performance