finding
active
finding:harness-updating-gain-spread-is-at-most-3-1-percentage-points-across-all-evolvers-on-any-single-benchmark

Harness-updating gain spread is at most 3.1 percentage points across all evolvers on any single benchmark

Core finding that harness-updating capability does not scale with model base capability

Source paper

extracted_from
Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents
(2026) · Minhua Lin · Juncheng Wu · Zijun Wang · Zhan Shi +13

Neighborhood — ranked by edge-count

Questions (1)

question

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.