claim
active
claim:causal-abstraction-is-not-enough-for-mechanistic-interpretability-because-it-becomes-vacuous-without-assumptions-about-how-models-encode-informationCausal abstraction is not enough for mechanistic interpretability because it becomes vacuous without assumptions about how models encode information
Central thesis of the paper
Source paper
extracted_from(2025) · Sutter, Denis · Minder, Julian · Hofmann, Thomas · Pimentel, Tiago
Neighborhood — ranked by edge-count
Papers (1)
paper
Findings (4)
finding
- Theorem 1: Any algorithm is an input-restricted distributed abstraction of any DNN satisfying mild assumptionsassociated_withsupportsCentral theoretical result proving unrestricted causal abstraction is trivial
- Training progression result showing non-linear maps are uncorrelated with genuine task learning
- Key empirical result: non-linear maps overcome linear maps' failure in deeper layers
- Corroborating result on additional task confirming main paper findings
Quotes (1)
quote
- Load-bearing formulation of the paper's central argument
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Methodological claim about the scientific value of combining causal abstraction with representational geometry analysis
- Authors' interpretation connecting their proof to practical interpretability methodology
- Historical framing of how representation assumptions have evolved in causal interpretability
- What is the connection between information encoding assumptions and causal abstraction?question0.816Identified as exciting future work direction
- Replication of Wu et al. 2023 finding; DAS expressivity concern validated in CausalGym setup
- Circular dependency problem raised in discussion
- The paper endorses Geiger et al. 2023's claim that disparate interpretability methods are instances of causal abstraction.
- Authors connect their finding to the prior probing literature debate