finding
active
finding:mas-successfully-aligns-behavior-between-multi-object-gru-models-in-both-embedding-and-hidden-state-layers-with-high-iiaMAS successfully aligns behavior between Multi-Object GRU models in both embedding and hidden state layers with high IIA
Demonstrates MAS's ability to bidirectionally transfer behavior where RSA shows low embedding correlation.
Neighborhood — ranked by edge-count
Claims (1)
claim
- Central motivating claim of the paper; supported by empirical comparisons showing RSA/CKA miss Markovian differences detectable by MAS.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Validates MAS as a causal detector of representational differences invisible to correlative methods.
- Shows MAS can compare specific numeric variables across tasks with different domains/codomains.
- Case study showing MAS can compare specific causal information types across models trained on different tasks.
- MAS reduces number of required alignment matrices for n-model comparison from n(n-1) or n^2 (stitching) to nfinding0.799Key computational efficiency advantage of MAS over traditional model stitching for multi-model comparisons.
- Prior work shows transformers use anti-Markovian solutions; MAS correctly shows low IIA reflecting this, while RSA/CKA do not detect it.
- GRU behavior can be compressed to as few as 4 dimensions using DAS and MAS with comparable IIAsfinding0.792Shows that behaviorally relevant information is low-dimensional; contrasted with model stitching achieving near-perfect IIA at rank 2.
- Demonstrates RSA's sensitivity issue in embedding layers; attributed partly to Spearman rank handling of RDMs with differing relative extrema.
- Evidence that model stitching can exploit the behavioral null space, making it less causally restrictive than MAS.