Gradient conflict

When gradients of different tasks have negative cosine similarity, harming multi-task learning.

Neighborhood — ranked by edge-count

paper

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Gradientsconcept0.813
The property that qualities vary slowly, subtly, gradually across the extent of each living thing; gradients arise as natural responses to changing circumstances and create field-like character that points toward and establishes centers
Gradient Descentmethod0.810
Used for updating hidden state expectations; provides dynamical process theory testable against neuronal data
Preference Conflictconcept0.800
Key element for alignment faking: model's pre-existing preferences contradict the new training objective
Gradient methodmethod0.800
Optimization technique that computes weight changes by following the gradient of an error function; contrasted with evolutionary stochastic search.
Gradient Dilution Issuefinding0.784
During RL training on ATLAS, sparse functional tokens (2.3% of sequences) receive diluted gradient signals from sequence-level advantages propagated across all tokens.
Gradient-based data attributionmethod0.782
Baseline method against which probe-based ranking is compared; more computationally expensive.
gradient-magnitude balancingconcept0.781
Addressing disparity in gradient magnitudes across tasks at the gradient level
Positional information gradientconcept0.767
Gradient that tells a cell its correct position; stress arises from deviation from this gradient.