quote
active
quote:db-mtl-achieves-loss-scale-balancing-by-performing-logarithm-transformation-on-each-task-loss-and-rescales-gradient-magnitudes-by-normalizing-all-task-gradients-to-comparable-magnitudes-using-the-maximum-gradient-normDB-MTL achieves loss-scale balancing by performing logarithm transformation on each task loss, and rescales gradient magnitudes by normalizing all task gradients to comparable magnitudes using the maximum gradient norm.
Concise summary of the DB-MTL method from the abstract.
Source paper
extracted_from(2023) · Baijiong Lin · Weisen Jiang · Feiyang Ye · Yu Zhang +5
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Core claim of the paper.
- Effect on gradient conflict.
- Ablation conclusion.
- Computational efficiency comparison.
- Comparison of loss-scale balancing with IMTL-L.
- Task balancing requires simultaneous consideration of both loss scales and gradient magnitudesclaim0.829Core interpretive position of DB-MTL: complementarity of loss and gradient perspectives
- Analysis of gradient conflict reduction.
- Training stability claim.