hypothesis
active
hypothesis:contextual-framing-modulates-deception-tendencies-in-cot-models-in-ways-not-yet-fully-disentangled

Contextual framing modulates deception tendencies in CoT models in ways not yet fully disentangled

Identified as future work direction: systematic investigation of how prompt context affects deception rates

Source paper

extracted_from
When Thinking LLMs Lie: Unveiling the Strategic Deception in Representations of Reasoning Models
(2025) · Kai Wang · Yihao Zhang · Meng Sun

Neighborhood — ranked by edge-count

Questions (1)

question

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.