hypothesis
active
hypothesis:larger-hidden-representations-create-more-random-structure-that-das-can-search-through-allowing-manipulation-of-counterfactual-behavior-even-in-randomly-initialized-networks

Larger hidden representations create more random structure that DAS can search through, allowing manipulation of counterfactual behavior even in randomly initialized networks

Tested in Section 4.4 calibration experiment; confirmed by findings.

Source paper

extracted_from
Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations
(2023) · Atticus Geiger · Zhengxuan Wu · Christopher Potts · Thomas Icard +1

Neighborhood — ranked by edge-count

Findings (2)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.