claim
active
claim:overbid-frequency-self-bidding-rate-bankrupt-initiation-patterns-and-context-dependent-offer-calibration-are-failure-modes-invisible-to-both-static-evaluations-and-aggregate-rankings-like-eloOverbid frequency, self-bidding rate, bankrupt-initiation patterns, and context-dependent offer calibration are failure modes invisible to both static evaluations and aggregate rankings like Elo
key claim about the benchmark's unique diagnostic value
Source paper
extracted_from(2026) · Robert Müller · Clemens Müller
Neighborhood — ranked by edge-count
Questions (1)
question
- open question from discussion
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Do these failure modes (overbidding, self-bidding, bankrupt initiation) generalise to other economic settings?question0.849Remains untested whether the specific LLM failures observed in CATTLE TRADE extend beyond this game.
- Concrete failure signatures extracted from traces.
- LLMs exhibit systematic errors that deterministic logic avoids.
- Does a high self-bidding rate reflect a failure to detect non-competitive contexts or a deliberate escalation?question0.818Ambiguity in interpreting the self-bidding metric: from a single trace, cannot distinguish error from aggressive strategy.
- Abstract sentence summarising performance and failures.
- design claim about transparency
- sophisticated bluff calibration
- Scaling Laws for Activation Steering with Llama 2 Models and Refusal Mechanisms (Ali et al., 2025)concept0.758Related work finding larger models more resistant to steering, potentially consistent with ESR in 70B