claim
active
claim:divergent-representations-are-a-common-if-not-likely-outcome-of-causal-interventions-across-a-wide-range-of-methods

Divergent representations are a common, if not likely, outcome of causal interventions across a wide range of methods

Core empirical claim of the paper supported by both theoretical proof and empirical demonstration

Source paper

extracted_from
Addressing divergent representations from causal interventions on neural networks
(2025) · Satchel Grant · Simon Jerome Han · Alexa R. Tartaglini · Christopher Potts

Neighborhood — ranked by edge-count

Findings (4)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.