claim

active

claim:behavioural-traces-surface-recurring-llm-failure-modes-including-overbidding-self-bidding-bankrupt-tc-initiation-and-weak-opponent-state-adaptation-that-never-appear-in-code-agents

Behavioural traces surface recurring LLM failure modes including overbidding, self-bidding, bankrupt TC initiation, and weak opponent-state adaptation that never appear in code agents.

LLMs exhibit systematic errors that deterministic logic avoids.

Source paper

extracted_from

Cattle Trade: A Multi-Agent Benchmark for LLM Bluffing, Bidding, and Bargaining

(2026) · Robert Müller · Clemens Müller

Neighborhood — ranked by edge-count

Findings (6)

finding

Gemini 2.5 Flash Lite self-bidding rate 78.5%
supports
G2.5-FL raises its own bid in over three-quarters of auction rounds.
The three code agents never overbid
supports
Deterministic heuristics avoid the overbidding failure mode entirely.
G2.5-FL overbid rate=1.20%, highest among all agents
supports
highest overbid frequency observed
G2.5-FL repeatedly initiates TCs after depleting money through overbidding
supports
failure to condition action choice on resource state
G2.5-FL self-bid rate=78.5%
supports
highest self-bid rate among all agents
Gemini 2.5 Flash Lite overbid rate 1.20%
supports
G2.5-FL has the highest overbid frequency among all agents.

Questions (2)

question

Does a high self-bidding rate reflect a failure to detect non-competitive contexts or a deliberate escalation?
gates
Ambiguity in interpreting the self-bidding metric: from a single trace, cannot distinguish error from aggressive strategy.
Do these failure modes (overbidding, self-bidding, bankrupt initiation) generalise to other economic settings?
gates
Remains untested whether the specific LLM failures observed in CATTLE TRADE extend beyond this game.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Two heuristic code agents outperform most tested LLMs, and behavioural traces surface recurring LLM failure modes including overbidding, self-bidding, bankrupt TC initiation, and weak opponent-state adaptation.quote0.917
Abstract sentence summarising performance and failures.
Overbid frequency, self-bidding rate, bankrupt-initiation patterns, and context-dependent offer calibration are failure modes invisible to both static evaluations and aggregate rankings like Eloclaim0.835
key claim about the benchmark's unique diagnostic value
Overbidding, self-bidding spirals, and undisciplined bluffing characterise failure.claim0.796
Concrete failure signatures extracted from traces.
Baseline LLM condition in IPD replicates prior findings: agents cooperate selectively only when opponent consistently cooperatesfinding0.782
Replication of Fontana et al. 2025 findings in the paper's own Experiment 2 baseline condition
Code agents operate on structured data with exact arithmetic, while LLMs must parse natural-language observations and track state across turns; some failures may partly reflect numerical parsing or working-memory limitationsclaim0.772
discussion of potential confounds
Do LLM failures in CATTLE TRADE reflect genuinely hard strategic problems or errors that novice humans also avoid?question0.772
Open question about benchmarking against human players to calibrate difficulty.
Sequences of contemporary Transformer-based LLM representations lack statistically significant indicators of observed 'consciousness' phenomena under the three stringent criteria.claim0.766
Primary conclusion of the study based on temporal permutation analysis failing all three criteria.
Conditional logic already suffices where LLMs still fail, as code agents avoid systematic failuresclaim0.766
contrast between rule-based and LLM reasoning