claim
active
claim:the-magnitude-of-the-normalized-gradients-choice-of-k-plays-an-important-role-in-performanceThe magnitude of the normalized gradients (choice of αk) plays an important role in performance.
Insight about gradient normalization scaling.
Source paper
extracted_from(2023) · Baijiong Lin · Weisen Jiang · Feiyang Ye · Yu Zhang +5
Neighborhood — ranked by edge-count
Findings (1)
finding
- Setting αk to the maximum gradient norm performs best among tested strategies on NYUv2 (Figure 6).supportsSensitivity analysis for gradient normalization scaling factor.
Communities (3)
community
- Dual-balancing multi-task learningmembers_ofDB-MTL jointly balances loss scale and gradient magnitude, benchmarked on NYUv2 and Office-31.
- Dual balancing multi-task learningmembers_ofDB-MTL combines loss-scale and gradient-magnitude balancing, benchmarked across NYUv2, Cityscapes, QM9, and Office datasets.
- Investigates optimal gradient balancing strategies across tasks, finding maximum gradient norm normalization outperforms alternatives in multitask optimization.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Recommended strategy for gradient normalization.
- We hypothesize that degraded generalization on benchmarks like MMLU may reflect the computational demands of the tasks.hypothesis0.762Connecting the paper's task-difficulty findings to prior observations of weak generalization on complex QA benchmarks.
- Advantage over GradNorm.
- Feature attribution (gradient-based) correlates 0.8 with ablation effects on the 'John' and 'Kobe' examples.finding0.760Validation of attribution as a fast proxy for causal importance.
- Claim that one of the most powerful forms of life has been almost removed from the environment by industrial production norms
- Claim about broader applicability of the scaling argument
- Mathematical constraint showing that backpropagation requires signed information