quote
active
quote:ai-systems-can-be-strategists-using-deception-because-they-have-reasoned-out-that-this-can-promote-a-goal

AI systems can be strategists, using deception because they have reasoned out that this can promote a goal

Load-bearing definition of strategic deception in AI systems from Park et al. 2023, adopted and refined in this paper

Source paper

extracted_from
When Thinking LLMs Lie: Unveiling the Strategic Deception in Representations of Reasoning Models
(2025) · Kai Wang · Yihao Zhang · Meng Sun

Neighborhood — ranked by edge-count

Concepts (1)

concept
  • Central concept of the paper: deliberate, goal-driven deception where model reasoning contradicts outputs

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.