finding
active
finding:db-mtl-increases-gradient-cosine-similarity-faster-and-keeps-it-positive-on-office-31-reducing-gradient-conflict-vs-ewDB-MTL increases gradient cosine similarity faster and keeps it positive on Office-31, reducing gradient conflict vs EW.
Analysis of gradient conflict reduction.
Source paper
extracted_from(2023) · Baijiong Lin · Weisen Jiang · Feiyang Ye · Yu Zhang +5
Neighborhood — ranked by edge-count
Claims (1)
claim
- Effect on gradient conflict.
Communities (3)
community
- Dual-balancing multi-task learningmembers_ofDB-MTL jointly balances loss scale and gradient magnitude, benchmarked on NYUv2 and Office-31.
- Dual balancing multi-task learningmembers_ofDB-MTL combines loss-scale and gradient-magnitude balancing, benchmarked across NYUv2, Cityscapes, QM9, and Office datasets.
- Dynamic balancing methods that increase gradient alignment and reduce task interference, evaluated on Office-31 domain adaptation.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Core claim of the paper.
- Concise summary of the DB-MTL method from the abstract.
- Computational efficiency comparison.
- Training stability claim.
- DB-MTL with EMA forgetting rate β in a wide range performs better than without EMA (β=0) on Office-31.finding0.804Effect of EMA forgetting rate on performance.
- Training stability analysis.
- Advantage over GradNorm.
- Observation illustrating the task balancing problem on NYUv2.