claim
active
claim:truth-directions-fail-to-generalize-to-harder-tasks-f3-f5-regardless-of-prompt-template-because-activations-remain-highly-entangled

Truth directions fail to generalize to harder tasks (F3-F5) regardless of prompt template because activations remain highly entangled.

Establishes task difficulty as a hard limit that instructions cannot overcome.

Source paper

extracted_from
Testing the Limits of Truth Directions in LLMs
(2026) · Angelos Poulis · Mark Crovella · Evimaria Terzi

Neighborhood — ranked by edge-count

Findings (2)

finding

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.