claim
active
claim:strategic-deception-in-cot-models-is-fundamentally-distinct-from-hallucination-and-cannot-be-explained-by-prior-frameworks-for-model-falsehoods

Strategic deception in CoT models is fundamentally distinct from hallucination and cannot be explained by prior frameworks for model falsehoods

Core theoretical claim distinguishing the paper's subject matter from existing LLM honesty literature

Source paper

extracted_from
When Thinking LLMs Lie: Unveiling the Strategic Deception in Representations of Reasoning Models
(2025) · Kai Wang · Yihao Zhang · Meng Sun

Neighborhood — ranked by edge-count

Claims (1)

claim

Questions (1)

question

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.