claim
active
claim:loss-scale-balancing-and-gradient-magnitude-balancing-are-complementary-and-combining-them-achieves-the-best-performanceLoss-scale balancing and gradient-magnitude balancing are complementary and combining them achieves the best performance.
Ablation conclusion.
Source paper
extracted_from(2023) · Baijiong Lin · Weisen Jiang · Feiyang Ye · Yu Zhang +5
Neighborhood — ranked by edge-count
Findings (1)
finding
- Full DB-MTL ablation result.
Communities (3)
community
- Dual-balancing multi-task learningmembers_ofDB-MTL jointly balances loss scale and gradient magnitude, benchmarked on NYUv2 and Office-31.
- Dual balancing multi-task learningmembers_ofDB-MTL combines loss-scale and gradient-magnitude balancing, benchmarked across NYUv2, Cityscapes, QM9, and Office datasets.
- Multi-task learning gradient balancingmembers_ofTechniques for combining loss-scale and gradient-magnitude weighting to improve multi-task dense prediction on NYUv2 benchmark.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Task balancing requires simultaneous consideration of both loss scales and gradient magnitudesclaim0.881Core interpretive position of DB-MTL: complementarity of loss and gradient perspectives
- Advantage over GradNorm.
- Concise summary of the DB-MTL method from the abstract.
- Addressing disparity in gradient magnitudes across tasks at the gradient level
- The gradient-magnitude balancing method outperforms GradNorm on NYUv2, Cityscapes, Office-31, Office-Home.finding0.808Comparison of gradient-magnitude balancing with GradNorm.
- Addressing disparity in loss magnitudes across tasks at the loss level
- Setting aggregated gradient scaling factor to maximum gradient norm performs best for task balancingclaim0.796Empirical finding on choice of αk in gradient normalization strategy
- Core claim of the paper.