finding
active
finding:the-ask-correct-template-delays-truth-direction-emergence-for-f3-and-reduces-performance-for-f4-f5-compared-to-no-promptThe ask-correct template delays truth direction emergence for F3 and reduces performance for F4-F5 compared to no-prompt.
Shows instruction effects extend to harder factual tasks.
Source paper
extracted_from(2026) · Angelos Poulis · Mark Crovella · Evimaria Terzi
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Establishes task difficulty as a hard limit that instructions cannot overcome.
- Finding that explicit correctness framing partially aligns truth directions across task families.
- Establishes F3-F5 as a hard generalization boundary that instructions cannot overcome.
- Key improvement in cross-task generalization enabled by explicit instruction framing.
- Specific question motivating the cross-template generalization experiment in Section 5.2.
- Contrasts with harder tasks that are sensitive to prompt variations.
- Does instructing the model to assess correctness affect the geometry of truth directions?question0.764One of the three guiding research questions of the paper.
- From the cross-task generalization heatmaps in Appendix B.3.3.