method
active
method:semantic-deduplicationSemantic Deduplication
Greedy pass retaining texts only if cosine similarity below 0.9 with all retained texts; used to maintain diverse statement and SJT corpora
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Central mechanism in denotational design: precise mathematical meaning assigned to types and expressions, independent of implementation.
- Formal modeling approach used throughout Fruit to provide mathematical meaning to GUI abstractions; enables reasoning about program properties.
- Meaning that arises from relations within the graphical system, not inherent in elements.
- The meaningful organization of concepts in a model's representation space, claimed to be better captured by manifolds than by SAEs.
- Denotation function µ decomposes over operations so meaning of compound expressions follows from meanings of parts
- The central idea that external structure binds latent patterns to desired targets.
- The semantic relation between words wp and wh (entails/neutral) used as an intermediate variable in the MoNLI high-level model.