method
active
method:semantic-deduplication

Semantic Deduplication

Greedy pass retaining texts only if cosine similarity below 0.9 with all retained texts; used to maintain diverse statement and SJT corpora

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Central mechanism in denotational design: precise mathematical meaning assigned to types and expressions, independent of implementation.
  • Formal modeling approach used throughout Fruit to provide mathematical meaning to GUI abstractions; enables reasoning about program properties.
  • semantic valueconcept0.752
    Meaning that arises from relations within the graphical system, not inherent in elements.
  • semantic structureconcept0.732
    The meaningful organization of concepts in a model's representation space, claimed to be better captured by manifolds than by SAEs.
  • Denotation function µ decomposes over operations so meaning of compound expressions follows from meanings of parts
  • semantic anchoringconcept0.719
    The central idea that external structure binds latent patterns to desired targets.
  • Lexical Entailmentconcept0.714
    The semantic relation between words wp and wh (entails/neutral) used as an intermediate variable in the MoNLI high-level model.