finding
active
finding:identity-subspace-of-left-equality-model-achieves-0-50-iia-indicating-equality-relations-cannot-be-decomposed-into-input-identitiesIdentity Subspace of Left Equality model achieves ~0.50 IIA, indicating equality relations cannot be decomposed into input identities
DAS reveals that the network encodes abstract equality relations rather than storing identities of inputs.
Source paper
extracted_from(2023) · Atticus Geiger · Zhengxuan Wu · Christopher Potts · Thomas Icard +1
Neighborhood — ranked by edge-count
Claims (1)
claim
- The feed-forward network truly implements a symbolic, tree-structured algorithm for hierarchical equality, with abstract equality relations not decomposable into input identitiesassociated_withsupportsDAS reveals that the neural network encodes abstract relational structure rather than raw input identities.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Load-bearing interpretive claim about the layer-specificity of Burger et al.'s finding.
- In contrast to hierarchical equality, lexical entailment in BERT decomposes into representations of word identities, not a single abstract relation.
- Concluding claim about theoretical significance of the hierarchical equality finding.
- Exception to the general trend; attributed to insufficient RevNet capacity rather than algorithm not being implemented
- Reinterpretation of Burger et al.'s finding as layer-specific rather than universal.
- Replicates Geiger et al. 2024b pattern of layer-dependent IIA degradation with linear maps
- Claim supporting the validity of the probe construction method via cross-validation with self-report
- Interpretive synthesis of DIM and cone intervention successes