question
active
question:what-is-the-connection-between-information-encoding-assumptions-and-causal-abstractionWhat is the connection between information encoding assumptions and causal abstraction?
Identified as exciting future work direction
Source paper
extracted_from(2025) · Sutter, Denis · Minder, Julian · Hofmann, Thomas · Pimentel, Tiago
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (1)
concept
- Core contribution: the impasse where lifting linearity in alignment maps makes causal abstraction vacuous, but keeping it may miss non-linearly encoded features
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Authors' interpretation connecting their proof to practical interpretability methodology
- Circular dependency problem raised in discussion
- Central thesis of the paper
- Load-bearing formulation of the paper's central argument
- Graded notion of causal abstraction measured by IIA; when IIA is alpha < 100%, the model is alpha-on-average approximately abstract.
- Methodological claim about the scientific value of combining causal abstraction with representational geometry analysis
- Historical framing of how representation assumptions have evolved in causal interpretability
- A framework the paper uses alongside feature geometry to deepen mechanistic understanding of LMs