question
active
question:what-nuances-do-we-miss-when-we-fail-to-causally-probe-the-representations-of-the-systemsWhat nuances do we miss when we fail to causally probe the representations of the systems?
Motivates the empirical comparison between MAS and RSA/CKA in the paper.
Neighborhood — ranked by edge-count
Papers (1)
paper
- Model Alignment Searchassociated_with
Claims (1)
claim
- Central motivating claim of the paper; supported by empirical comparisons showing RSA/CKA miss Markovian differences detectable by MAS.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Question raised by the discrepancy between DAS IIA and linear probe accuracy in Case Study II
- Key interpretive claim from Case Study II distinguishing probe accuracy from causal relevance
- Motivated by the finding that lexical entailment decomposes into word identities.
- Supported by the finding that non-trivial rotations are required to find aligned representations.
- Load-bearing epistemic caution the author places on the entire analytical framework.
- The central scientific question the paper addresses through the lens of interventional causality.
- Key methodological claim: MM probes are both competitive in accuracy and superior in causal influence
- Opening sentence defining self-evidencing.