concept
active
concept:embedding-whitening

Embedding Whitening

Preprocessing step using dev-set covariance to standardize span embeddings before computing S

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Preprocessing step that uses dev-set covariance to standardize embedding scales before computing ρd and dr.
  • Lagged time series used to capture dynamical dependencies.
  • Embedmentconcept0.771
    Technique where text is nested hierarchically within another, using indentation and margins to create subordinate orders of detail within an overarching embrace.
  • The specific type of representation studied in the paper: function f: X→R^n assigning feature vectors to inputs
  • Calibration protocol: whiten embeddings on dev pool, z-score ρd and dr per layer.
  • Token embeddingsconcept0.760
    Vector representations of individual tokens from genomic foundation models; the raw inputs to sequence pooling methods.
  • Mutual Embeddingconcept0.755
    A reinforcing interlock between different materials, mentioned alongside Deep Interlock in West Dean construction.
  • Standardization of ρd, dr, and log k on dev set for computing S.