finding
active
finding:das-on-oversized-randomly-initialized-network-n-4096-for-16-dim-input-achieves-0-64-iia-by-searching-random-structureDAS on oversized randomly initialized network (|N|=4096 for 16-dim input) achieves 0.64 IIA by searching random structure
Shows that overly large hidden dimensions allow DAS to find random causal structures; calibration check.
Source paper
extracted_from(2023) · Atticus Geiger · Zhengxuan Wu · Christopher Potts · Thomas Icard +1
Neighborhood — ranked by edge-count
Hypotheses (1)
hypothesis
- Tested in Section 4.4 calibration experiment; confirmed by findings.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Demonstrates DAS cannot manufacture behaviors from random structure in appropriately sized networks.
- From Klein & Hoel (2020) analysis of artificial complex networks.
- DAS achieves 100% IIA on hierarchical equality task with |N|=16, intervention size 8, Layer 1finding0.732DAS discovers a perfect alignment between the feed-forward network and the Both Equality Relations high-level model.
- DAS runtime is invariant with number of testing hypotheses, unlike brute-force search.
- Key geometry-to-behavior bridge finding in E3; robust to pooling choice, cosine vs. L2, and frozen external encoder
- Methodological limitation disproportionately affecting the largest MoE model, constraining generalizability.
- DAS behavioral loss produces EMD along feature dimensions of 0.032±0.003 on synthetic 10-class datasetfinding0.721Quantitative baseline for divergence using behavioral DAS loss on synthetic dataset
- DAS behavioral loss achieves IIA of 0.997±0.001 on synthetic 10-class dataset training/test setsfinding0.717IIA baseline for DAS behavioral loss on synthetic dataset