method
active
method:harness-updating-gain-updateHarness-Updating Gain (Δupdate)
Metric measuring harness-updating capability as the mean pairwise gain across an anchor agent set
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (1)
concept
- Anchor Agent Setassociated_withFixed set of representative task-solving agents (Opus 4.6, Sonnet 4.6, Qwen3-235B) used to compute harness-updating capability metrics
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Metric measuring harness-benefit capability as the maximum pairwise gain across a fixed anchor evolver set
- The capability of an evolver model to produce useful persistent harness updates from execution evidence
- Verbatim summary of first major finding from conclusion
- First major claim of the paper, supported by narrow spread across evolvers and case study
- Harness-updating gain spread is at most 3.1 percentage points across all evolvers on any single benchmarkfinding0.765Core finding that harness-updating capability does not scale with model base capability
- First open question the paper sets out to answer through evolver-side analysis
- Second open question the paper sets out to answer through agent-side analysis
- The capability of a task-solving agent to benefit from updated harnesses during task solving