question
active
question:what-can-causal-abstraction-analyses-tell-us-about-how-dnns-encode-features-if-the-methods-themselves-rely-on-encoding-assumptionsWhat can causal abstraction analyses tell us about how DNNs encode features if the methods themselves rely on encoding assumptions?
Circular dependency problem raised in discussion
Source paper
extracted_from(2025) · Sutter, Denis · Minder, Julian · Hofmann, Thomas · Pimentel, Tiago
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Authors' interpretation connecting their proof to practical interpretability methodology
- Load-bearing formulation of the paper's central argument
- What is the connection between information encoding assumptions and causal abstraction?question0.831Identified as exciting future work direction
- Methodological claim about the scientific value of combining causal abstraction with representational geometry analysis
- Central thesis of the paper
- Historical framing of how representation assumptions have evolved in causal interpretability
- Interpretive claim about what linear DAS results actually tell us
- Motivated by the finding that lexical entailment decomposes into word identities.