finding
active
finding:claude-haiku-4-5-and-economyagent-average-fewer-than-1-7-quartets-per-gameClaude Haiku 4.5 and EconomyAgent average fewer than 1.7 quartets per game
Weak agents complete very few quartets, correlating with low scores.
Source paper
extracted_from(2026) · Robert Müller · Clemens Müller
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- high quartet completion rate
- Haiku's overbid frequency is second highest after G2.5-FL.
- Very low win rate against code agents.
- Full evolver-side SWE results showing comparable performance across Claude family tiers
- Much higher cost per quartet due to waste.
- G3-F completed on average 3.96 quartets per game.
- These two LLMs bargain with minimal overpayment but low overall efficiency.
- Linked to Claude 3.5 Sonnet not exhibiting pro-animal-welfare preferences