concept
active
concept:truth-direction-universality

Truth direction universality

The claim that truth directions are consistent and generalizable across layers, tasks, and prompt formats in LLMs.

Neighborhood — ranked by edge-count

Papers (1)

paper

Findings (3)

finding

Concepts (1)

concept
  • Truth Direction
    associated_withrelated_to
    A hypothesized direction in LLM activation space that encodes the truth or falsehood of factual statements

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.