claim
active
claim:das-finds-better-alignments-than-brute-force-search-by-using-gradient-descent-rather-than-exhaustive-discrete-searchDAS finds better alignments than brute-force search by using gradient descent rather than exhaustive discrete search
Second central claim of the paper.
Source paper
extracted_from(2023) · Atticus Geiger · Zhengxuan Wu · Christopher Potts · Thomas Icard +1
Neighborhood — ranked by edge-count
Papers (1)
paper
Findings (4)
finding
- Perfect abstraction relation between BERT and symbolic algorithm with negation and lexical entailment variables.
- DAS substantially outperforms brute-force search (1.00 vs 0.60 IIA) on the hierarchical equality task.
- DAS substantially outperforms brute-force search on MoNLI across all models.
- DAS runtime is invariant with number of testing hypotheses, unlike brute-force search.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Practical method by Geiger et al. for finding distributed causal abstractions using gradient descent
- Baseline method that exhaustively searches discrete spaces of localist alignments between high-level variables and neuron groups.
- We hypothesized that divergence could influence IIA when transferring the DAS alignment to OOD settingshypothesis0.760Motivating hypothesis for the OOD experiment testing practical utility of divergence reduction
- Tested in Section 4.4 calibration experiment; confirmed by findings.
- Observation from practical experience with DNA sequence comparison.
- DAS achieves 100% IIA on hierarchical equality task with |N|=16, intervention size 8, Layer 1finding0.753DAS discovers a perfect alignment between the feed-forward network and the Both Equality Relations high-level model.
- Authors' claim that their approach is both more effective in reduction and cheaper than prior methods.
- Replication of Wu et al. 2023 finding; DAS expressivity concern validated in CausalGym setup