concept
active
concept:gradient-conflict

Gradient conflict

When gradients of different tasks have negative cosine similarity, harming multi-task learning.

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Gradientsconcept0.813
    The property that qualities vary slowly, subtly, gradually across the extent of each living thing; gradients arise as natural responses to changing circumstances and create field-like character that points toward and establishes centers
  • Gradient Descentmethod0.810
    Used for updating hidden state expectations; provides dynamical process theory testable against neuronal data
  • Key element for alignment faking: model's pre-existing preferences contradict the new training objective
  • Gradient methodmethod0.800
    Optimization technique that computes weight changes by following the gradient of an error function; contrasted with evolutionary stochastic search.
  • During RL training on ATLAS, sparse functional tokens (2.3% of sequences) receive diluted gradient signals from sequence-level advantages propagated across all tokens.
  • Baseline method against which probe-based ranking is compared; more computationally expensive.
  • Addressing disparity in gradient magnitudes across tasks at the gradient level
  • Gradient that tells a cell its correct position; stress arises from deviation from this gradient.