finding
active
finding:model-stitching-achieves-nearly-perfect-iia-even-for-rank-2-transformation-matrices-on-multi-object-gru-modelsModel stitching achieves nearly perfect IIA even for rank-2 transformation matrices on Multi-Object GRU models
Evidence that model stitching can exploit the behavioral null space, making it less causally restrictive than MAS.
Neighborhood — ranked by edge-count
Claims (1)
claim
- Model stitching can use the behavioral null space of the source model when mapping to the target, making successful stitching insufficient evidence of representational similarityassociated_withsupportsFormal analysis showing the theoretical limitation of model stitching as a similarity measure.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Demonstrates MAS's ability to bidirectionally transfer behavior where RSA shows low embedding correlation.
- MAS reduces number of required alignment matrices for n-model comparison from n(n-1) or n^2 (stitching) to nfinding0.759Key computational efficiency advantage of MAS over traditional model stitching for multi-model comparisons.
- Validates MAS as a causal detector of representational differences invisible to correlative methods.
- Moschella et al. result cited as evidence of representational convergence across models
- Strong evidence for representational alignment across models
- Empirical support for vacuousness of unrestricted causal abstraction
- Core interpretive claim supported by the formal analysis showing MAS does not exploit the behavioral null space unlike stitching.
- Prior work shows transformers use anti-Markovian solutions; MAS correctly shows low IIA reflecting this, while RSA/CKA do not detect it.