claim
active
claim:the-logarithm-transformation-also-benefits-existing-gradient-balancing-methodsThe logarithm transformation also benefits existing gradient balancing methods.
Generalization of the loss transformation.
Source paper
extracted_from(2023) · Baijiong Lin · Weisen Jiang · Feiyang Ye · Yu Zhang +5
Neighborhood — ranked by edge-count
Findings (1)
finding
- Effectiveness of logarithm transformation as a plug-in for gradient balancing methods.
Communities (3)
community
- Dual-balancing multi-task learningmembers_ofDB-MTL jointly balances loss scale and gradient magnitude, benchmarked on NYUv2 and Office-31.
- Dual balancing multi-task learningmembers_ofDB-MTL combines loss-scale and gradient-magnitude balancing, benchmarked across NYUv2, Cityscapes, QM9, and Office datasets.
- Parameter-free logarithm transformation for multi-task learning that improves gradient balancing methods like PCGrad and Nash-MTL across vision benchmarks.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- We find that the logarithm transformation also benefits existing gradient balancing methods.quote0.964Key finding showing the broader utility of the log transformation.
- Compared to IMTL-L: parameter-free, no extra computational cost, achieves same theoretical goal
- Comparison of loss-scale balancing with IMTL-L.
- The logarithm transformation is simpler and more effective than IMTL-L because it is parameter-free.claim0.805Comparison of loss-scale balancing techniques.
- Advantage over GradNorm.
- Concise summary of the DB-MTL method from the abstract.
- Ablation conclusion.
- The gradient-magnitude balancing method outperforms GradNorm on NYUv2, Cityscapes, Office-31, Office-Home.finding0.767Comparison of gradient-magnitude balancing with GradNorm.