finding
active
finding:pairing-weakest-anchor-agent-with-best-evolver-against-strongest-anchor-with-worst-evolver-the-strong-agent-still-leads-by-18-6-to-35-2-pp-on-every-benchmark

Pairing weakest anchor agent with best evolver against strongest anchor with worst evolver, the strong agent still leads by 18.6 to 35.2 pp on every benchmark

Confirms that post-evolution performance bottleneck is on the agent side, not evolver side

Source paper

extracted_from
Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents
(2026) · Minhua Lin · Juncheng Wu · Zijun Wang · Zhan Shi +13

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.