question
active
question:what-is-the-effect-of-model-instructions-on-truth-directionsWhat is the effect of model instructions on truth directions?
Research question motivating Section 5.
Source paper
extracted_from(2026) · Angelos Poulis · Mark Crovella · Evimaria Terzi
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Does instructing the model to assess correctness affect the geometry of truth directions?question0.847One of the three guiding research questions of the paper.
- Motivating hypothesis for Section 5's investigation of prompt template effects.
- Interpretation of KL divergence retention results
- Open question on generalization beyond Gemma and Qwen families
- Overarching conclusion summarizing the paper's contribution relative to prior universality claims.
- Specific question motivating the cross-template generalization experiment in Section 5.2.
- A hypothesized direction in LLM activation space that encodes the truth or falsehood of factual statements
- Safety implication derived from multi-dimensional truth structure finding