finding
active
finding:cka-and-rsa-show-potentially-unintuitive-over-estimated-hidden-state-similarity-for-gru-transformer-comparisons-on-multi-object-taskCKA and RSA show potentially unintuitive (over-estimated) hidden state similarity for GRU-Transformer comparisons on Multi-Object task
Prior work shows transformers use anti-Markovian solutions; MAS correctly shows low IIA reflecting this, while RSA/CKA do not detect it.
Neighborhood — ranked by edge-count
Claims (1)
claim
- Correlative methods like RSA and CKA are insufficient for determining functional similarity between neural systems; causal methods are necessaryassociated_withsupportsCentral motivating claim of the paper; supported by empirical comparisons showing RSA/CKA miss Markovian differences detectable by MAS.
Frameworks (1)
framework
- Representational Similarity Analysis (RSA)associated_withA correlational similarity method compared against MAS; uses RDM correlations between model representations.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Validates MAS as a causal detector of representational differences invisible to correlative methods.
- Demonstrates RSA's sensitivity issue in embedding layers; attributed partly to Spearman rank handling of RDMs with differing relative extrema.
- Demonstrates MAS's ability to bidirectionally transfer behavior where RSA shows low embedding correlation.
- Explains why mutual k-NN was chosen over CKA as primary metric
- Claim formalizing the Anima Labs idea that transformers are effectively recurrent due to K/V stream.
- Tested in Section 4.4 calibration experiment; confirmed by findings.
- Validates robustness of alignment metric choice
- Interpretive claim connecting exponential path combinatorics to Lindsey's layer-dependent findings.