finding
active
finding:das-achieves-substantial-causal-effect-even-on-arbitrary-input-output-mappings-where-no-causal-mechanism-should-existDAS achieves substantial causal effect even on arbitrary input-output mappings where no causal mechanism should exist
Replication of Wu et al. 2023 finding; DAS expressivity concern validated in CausalGym setup
Source paper
extracted_from(2024) · Aryaman Arora · Dan Jurafsky · Christopher Potts
Neighborhood — ranked by edge-count
Papers (1)
paper
Claims (1)
claim
- Author interpretation of selectivity results showing DAS advantage diminishes when controlling for expressivity
Findings (1)
finding
- Corroborates Wu et al. 2023 finding that DAS expressivity inflates causal effect estimates
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Core methodological question motivating the introduction of selectivity and control tasks
- Central claim motivating DAS over prior methods.
- Central thesis of the paper
- Methodological claim about the scientific value of combining causal abstraction with representational geometry analysis
- Authors' interpretation connecting their proof to practical interpretability methodology
- Authors connect their finding to the prior probing literature debate
- Causal emergence measured by NIS+ increases with observational noise but decreases with dynamical noise.finding0.777Insight that coarse-graining filters external noise but not intrinsic noise.
- Historical framing of how representation assumptions have evolved in causal interpretability