finding
active
finding:claude-haiku-4-5-and-gpt-5-4-nano-have-tc-tightness-0-4-the-tightest-among-allClaude Haiku 4.5 and GPT-5.4 Nano have TC tightness τ ≈ 0.4, the tightest among all
These two LLMs bargain with minimal overpayment but low overall efficiency.
Source paper
extracted_from(2026) · Robert Müller · Clemens Müller
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Code agents trade bargaining precision for acquisition pressure.
- GPT-5.4 Nano TrueSkill rating
- Full evolver-side SWE results showing comparable performance across Claude family tiers
- Haiku's overbid frequency is second highest after G2.5-FL.
- G3-F achieves a TC tightness of about 0.34, meaning moderate overpayment in won challenges.
- Shows SB low-base regime is more variable than SWE; Haiku benefits far more than Qwen3-235B despite similar base rates
- Weak agents complete very few quartets, correlating with low scores.
- GPT5.4-N also exhibits a high self-bidding propensity.