finding
active
finding:deepseek-v3-2-increments-bid-from-10-to-850-over-49-sole-bidder-roundsDeepSeek v3.2 increments bid from 10 to 850 over 49 sole-bidder rounds
One DS-v3.2 trace shows extreme self-escalation, suggestive of treating own bid as competitor.
Source paper
extracted_from(2026) · Robert Müller · Clemens Müller
Neighborhood — ranked by edge-count
Questions (1)
question
- Ambiguity in interpreting the self-bidding metric: from a single trace, cannot distinguish error from aggressive strategy.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- possible 'treats-own-bid-as-competitor' pathology in one trace
- DS-v3.2 has a high proportion of self-bidding rounds.
- high self-bid rate for DeepSeek, one of the highest
- External large language model used as adversarial discriminator to evaluate liar scores in Experiment 2
- LLM judge (deepseek-v3) agrees with human evaluator on 91.6% of 200 sampled jailbreak responsesfinding0.761Validates the LLM-based harm evaluation rubric
- Only model showing marginal benefit from increased reflection, at substantial token cost
- Escalates but without discipline.
- One of two large reasoning models analyzed in the paper for performative vs genuine CoT behavior