finding
active
finding:das-achieves-100-iia-on-hierarchical-equality-task-with-n-16-intervention-size-8-layer-1DAS achieves 100% IIA on hierarchical equality task with |N|=16, intervention size 8, Layer 1
DAS discovers a perfect alignment between the feed-forward network and the Both Equality Relations high-level model.
Source paper
extracted_from(2023) · Atticus Geiger · Zhengxuan Wu · Christopher Potts · Thomas Icard +1
Neighborhood — ranked by edge-count
Claims (3)
claim
- Central claim motivating DAS over prior methods.
- DAS reveals that the neural network encodes abstract relational structure rather than raw input identities.
- Concluding claim about theoretical significance of the hierarchical equality finding.
Questions (1)
question
- Specific research question for the first experiment.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- DAS runtime is invariant with number of testing hypotheses, unlike brute-force search.
- Brute-force search achieves best IIA of 0.60 on hierarchical equality Both Equality Relations in Layer 1finding0.803DAS substantially outperforms brute-force search (1.00 vs 0.60 IIA) on the hierarchical equality task.
- Perfect abstraction relation between BERT and symbolic algorithm with negation and lexical entailment variables.
- Demonstrates that high IIA can be obtained even when model cannot solve the task
- Best localist alignment achieves IIA of 0.73 on hierarchical equality Both Equality Relations in Layer 1finding0.770Shows localist alignment fails to capture the distributed structure found by DAS.
- Interpretive claim from Case Study II about the distinction between correlational probes and causal interventions
- DAS behavioral loss achieves IIA of 0.997±0.001 on synthetic 10-class dataset training/test setsfinding0.767IIA baseline for DAS behavioral loss on synthetic dataset
- Case Study II result showing DAS identifies fewer causally relevant positions than a probe