finding
active
finding:best-localist-alignment-achieves-iia-of-0-73-on-hierarchical-equality-both-equality-relations-in-layer-1Best localist alignment achieves IIA of 0.73 on hierarchical equality Both Equality Relations in Layer 1
Shows localist alignment fails to capture the distributed structure found by DAS.
Source paper
extracted_from(2023) · Atticus Geiger · Zhengxuan Wu · Christopher Potts · Thomas Icard +1
Neighborhood — ranked by edge-count
Claims (1)
claim
- Central claim motivating DAS over prior methods.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Localist methods fail entirely on MoNLI distributed representations.
- Exception to the general trend; attributed to insufficient RevNet capacity rather than algorithm not being implemented
- Replicates Geiger et al. 2024b pattern of layer-dependent IIA degradation with linear maps
- Brute-force search achieves best IIA of 0.60 on hierarchical equality Both Equality Relations in Layer 1finding0.795DAS substantially outperforms brute-force search (1.00 vs 0.60 IIA) on the hierarchical equality task.
- Baseline that finds the axis-aligned orthogonal matrix closest to the learned distributed rotation, assuming disjoint neuron groups.
- Demonstrates that high IIA can be obtained even when model cannot solve the task
- Key empirical result: non-linear maps overcome linear maps' failure in deeper layers
- DAS achieves 100% IIA on hierarchical equality task with |N|=16, intervention size 8, Layer 1finding0.770DAS discovers a perfect alignment between the feed-forward network and the Both Equality Relations high-level model.