question
active
question:how-much-of-the-causal-effect-found-by-das-is-due-to-its-expressivity-rather-than-any-aspect-of-the-representation-being-studied

How much of the causal effect found by DAS is due to its expressivity rather than any aspect of the representation being studied?

Core methodological question motivating the introduction of selectivity and control tasks

Source paper

extracted_from
CausalGym: Benchmarking causal interpretability methods on linguistic tasks
(2024) · Aryaman Arora · Dan Jurafsky · Christopher Potts

Neighborhood — ranked by edge-count

Findings (1)

finding

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.