concept
active
concept:gradient-conflictGradient conflict
When gradients of different tasks have negative cosine similarity, harming multi-task learning.
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The property that qualities vary slowly, subtly, gradually across the extent of each living thing; gradients arise as natural responses to changing circumstances and create field-like character that points toward and establishes centers
- Used for updating hidden state expectations; provides dynamical process theory testable against neuronal data
- Key element for alignment faking: model's pre-existing preferences contradict the new training objective
- Optimization technique that computes weight changes by following the gradient of an error function; contrasted with evolutionary stochastic search.
- During RL training on ATLAS, sparse functional tokens (2.3% of sequences) receive diluted gradient signals from sequence-level advantages propagated across all tokens.
- Baseline method against which probe-based ranking is compared; more computationally expensive.
- Addressing disparity in gradient magnitudes across tasks at the gradient level
- Gradient that tells a cell its correct position; stress arises from deviation from this gradient.