concept
active
concept:difference-in-means-direction

Difference-in-Means Direction

Vector from mean of false representations to mean of true representations; core of mass-mean probing

Neighborhood — ranked by edge-count

Frameworks (1)

framework
  • Introduced in this paper: an optimization-free probing technique using difference-in-means direction with optional covariance correction

Concepts (1)

concept
  • Truth Direction
    associated_with
    A hypothesized direction in LLM activation space that encodes the truth or falsehood of factual statements

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.