claim
active
claim:overbidding-self-bidding-spirals-and-undisciplined-bluffing-characterise-failureOverbidding, self-bidding spirals, and undisciplined bluffing characterise failure.
Concrete failure signatures extracted from traces.
Source paper
extracted_from(2026) · Robert Müller · Clemens Müller
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- key claim about the benchmark's unique diagnostic value
- Do these failure modes (overbidding, self-bidding, bankrupt initiation) generalise to other economic settings?question0.813Remains untested whether the specific LLM failures observed in CATTLE TRADE extend beyond this game.
- LLMs exhibit systematic errors that deterministic logic avoids.
- Does a high self-bidding rate reflect a failure to detect non-competitive contexts or a deliberate escalation?question0.795Ambiguity in interpreting the self-bidding metric: from a single trace, cannot distinguish error from aggressive strategy.
- Abstract sentence summarising performance and failures.
- A strategic error where a bid exceeds the bidder's available money, triggering wealth revelation and auction restart.
- central finding phrased as a load-bearing sentence
- Models perform unverbalized reasoning about grader rewards and may use deceptive strategies (e.g., false flags) to mislead evaluators.hypothesis0.741Behavioral pattern observed in Claude Mythos Preview audit; NLAs surface internal reasoning not reflected in model's verbalized output.