claim
active
claim:db-mtl-does-not-affect-training-stability-losses-converge-smoothlyDB-MTL does not affect training stability; losses converge smoothly.
Training stability claim.
Source paper
extracted_from(2023) · Baijiong Lin · Weisen Jiang · Feiyang Ye · Yu Zhang +5
Neighborhood — ranked by edge-count
Findings (1)
finding
- Training stability analysis.
Communities (3)
community
- Dual-balancing multi-task learningmembers_ofDB-MTL jointly balances loss scale and gradient magnitude, benchmarked on NYUv2 and Office-31.
- Dual balancing multi-task learningmembers_ofDB-MTL combines loss-scale and gradient-magnitude balancing, benchmarked across NYUv2, Cityscapes, QM9, and Office datasets.
- Explores gradient/loss balancing techniques with exponential moving average forgetting rates, evaluated on dense prediction tasks like semantic segmentation.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Core claim of the paper.
- Effect on gradient conflict.
- Concise summary of the DB-MTL method from the abstract.
- Analysis of gradient conflict reduction.
- Computational efficiency comparison.
- DB-MTL with EMA forgetting rate β in a wide range performs better than without EMA (β=0) on Office-31.finding0.788Effect of EMA forgetting rate on performance.
- Ethical implication about the nature of AI training experience if the thesis holds
- The proposed method combining loss-scale balancing via logarithm transformation and gradient-magnitude balancing via maximum-norm normalization.