question
active
question:what-should-you-do-if-you-want-to-perform-a-causal-analysis-of-your-dnnWhat should you do if you want to perform a causal analysis of your DNN?
Practical question the paper attempts to answer in its conclusion
Source paper
extracted_from(2025) · Sutter, Denis · Minder, Julian · Hofmann, Thomas · Pimentel, Tiago
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Non-Linear Representation Dilemmaassociated_withCore contribution: the impasse where lifting linearity in alignment maps makes causal abstraction vacuous, but keeping it may miss non-linearly encoded features
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Circular dependency problem raised in discussion
- Motivates the bidirectional design of MAS over unidirectional model stitching.
- Authors' interpretation connecting their proof to practical interpretability methodology
- Load-bearing formulation of the paper's central argument
- The formal method used to establish that the identified circuit causally mediates the model's cyclic reasoning behavior
- Motivated by the finding that lexical entailment decomposes into word identities.
- Mechanistic interpretability technique for locating factual associations, mentioned as future work direction.
- Vision statement in the conclusion.