claim
active
claim:the-role-play-framing-allows-us-to-meaningfully-distinguish-in-dialogue-agents-the-same-three-cases-of-giving-false-information-as-in-humans-without-anthropomorphism

The role-play framing allows us to meaningfully distinguish, in dialogue agents, the same three cases of giving false information as in humans, without anthropomorphism

Key practical application of the role-play framework to the problem of trustworthiness

Neighborhood — ranked by edge-count

Concepts (3)

concept
  • Confabulation
    associated_with
    A form of cognitive plasticity where minds actively modify and reinterpret memory data to preserve psychological coherence; reframed as adaptive rather than pathological.
  • Good Faith Error
    associated_with
    Second category of giving false information: role-playing truth-telling but with incorrect information encoded in weights
  • Third category: agent role-playing a deceptive character, comparable to but not literally deliberate deception

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.