claim
active
claim:off-manifold-divergences-can-activate-hidden-pathways-that-produce-misleadingly-confirmatory-behavior-while-the-true-mechanism-is-never-exercised

Off-manifold divergences can activate hidden pathways that produce misleadingly confirmatory behavior while the true mechanism is never exercised

Core claim about why pernicious divergence undermines mechanistic conclusions

Source paper

extracted_from
Addressing divergent representations from causal interventions on neural networks
(2025) · Satchel Grant · Simon Jerome Han · Alexa R. Tartaglini · Christopher Potts

Neighborhood — ranked by edge-count

Findings (2)

finding

Questions (1)

question

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.