claim
active
claim:the-feed-forward-network-truly-implements-a-symbolic-tree-structured-algorithm-for-hierarchical-equality-with-abstract-equality-relations-not-decomposable-into-input-identitiesThe feed-forward network truly implements a symbolic, tree-structured algorithm for hierarchical equality, with abstract equality relations not decomposable into input identities
DAS reveals that the neural network encodes abstract relational structure rather than raw input identities.
Source paper
extracted_from(2023) · Atticus Geiger · Zhengxuan Wu · Christopher Potts · Thomas Icard +1
Neighborhood — ranked by edge-count
Papers (1)
paper
Findings (2)
finding
- Identity Subspace of Left Equality model achieves ~0.50 IIA, indicating equality relations cannot be decomposed into input identitiesassociated_withsupportsDAS reveals that the network encodes abstract equality relations rather than storing identities of inputs.
- DAS achieves 100% IIA on hierarchical equality task with |N|=16, intervention size 8, Layer 1supportsDAS discovers a perfect alignment between the feed-forward network and the Both Equality Relations high-level model.
Claims (1)
claim
- Key asymmetry between hierarchical equality and NLI experiments; BERT stores identities rather than the abstract relation.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Does the hierarchical equality network implement a program that computes w=x and y=z as intermediate values?question0.799Specific research question for the first experiment.
- Central theoretical result proving unrestricted causal abstraction is trivial
- Architectural requirement from machine learning.
- Concluding claim about theoretical significance of the hierarchical equality finding.
- Can an interpretable symbolic algorithm be used to faithfully explain a complex neural network model?question0.750Framing question for the paper's research program.
- Interpretation that Friedman’s use of Baran’s distributed networks is a political statement for a non-paternalistic society.
- Explains why biological systems achieve organization across scales while language models struggle; grounds in free energy scaling