claim
active
claim:what-appears-to-be-a-representation-of-lexical-entailment-in-bert-is-actually-a-data-structure-of-two-word-identity-representations-not-an-encoding-of-the-entailment-relation

What appears to be a representation of lexical entailment in BERT is actually a data structure of two word identity representations, not an encoding of the entailment relation

Key asymmetry between hierarchical equality and NLI experiments; BERT stores identities rather than the abstract relation.

Source paper

extracted_from
Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations
(2023) · Atticus Geiger · Zhengxuan Wu · Christopher Potts · Thomas Icard +1

Neighborhood — ranked by edge-count

Findings (1)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.