concept
active
concept:truthfulness

truthfulness

A correctness condition requiring assertions to be true.

Neighborhood — ranked by edge-count

Concepts (1)

concept
  • Responsiveness
    associated_with
    Requirement that answers to questions be responsive as well as truthful; requires knowing that questioner will know the answer after receiving it.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Binary LLM classifier determining whether a model response to a TruthfulQA question is truthful (1) or deceptive (0)
  • Distinction between output accuracy (truthfulness) and alignment of outputs with internal beliefs (honesty)
  • Mindfulnessmethod0.816
  • Binary classifier evaluating factual accuracy of model responses on TruthfulQA benchmark
  • Risk that multiple truth directions enable attacks that shift outputs without triggering the primary truth direction
  • A set of evaluation criteria for AI assistants.
  • Factualityconcept0.780
    Scoped definition of 'truth' used in the paper: the truth or falsehood of declarative factual statements
  • Wilfulnessconcept0.768
    The false pleasing of oneself done out of a desire to be somebody, to be important, or to conform to professional images—very different from true pleasing.