claim
active
claim:the-ask-arith-prompt-shows-weaker-generalization-to-factual-tasks-compared-to-other-explicit-prompts-suggesting-a-specialized-arithmetic-prompt-does-not-create-a-unified-truth-direction-across-task-families

The ask-arith prompt shows weaker generalization to factual tasks compared to other explicit prompts, suggesting a specialized arithmetic prompt does not create a unified truth direction across task families.

From the cross-task generalization heatmaps in Appendix B.3.3.

Source paper

extracted_from
Testing the Limits of Truth Directions in LLMs
(2026) · Angelos Poulis · Mark Crovella · Evimaria Terzi

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.