claim
active
claim:discovered-truth-directions-are-highly-specific-and-do-not-interfere-with-general-instruction-following-behavior

Discovered truth directions are highly specific and do not interfere with general instruction-following behavior

Interpretation of KL divergence retention results

Source paper

extracted_from
From Directions to Cones: Exploring Multidimensional Representations of Propositional Facts in LLMs
(2025) · Kevin Shengyang Yu · Vaidehi Bulusu · Oscar Yasunaga · Lau, Clayton +4

Neighborhood — ranked by edge-count

Findings (1)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.