finding
active
finding:within-family-factual-generalization-f0-f2-is-consistently-strong-across-all-models-and-prompt-settingsWithin-family factual generalization (F0-F2) is consistently strong across all models and prompt settings.
Establishes a reliable baseline for factual truth direction universality within simple factual recall.
Source paper
extracted_from(2026) · Angelos Poulis · Mark Crovella · Evimaria Terzi
Neighborhood — ranked by edge-count
Claims (1)
claim
- Central empirical conclusion of the paper about the fundamental limits of truth directions.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Shows the instruction effect, while shifting geometry, may not produce consistent generalization effects across model families.
- Contrasts with harder tasks that are sensitive to prompt variations.
- Establishes generalizability of the core difficulty-boundary finding across model families.
- Establishes F3-F5 as a hard generalization boundary that instructions cannot overcome.
- Establishes task difficulty as a hard limit that instructions cannot overcome.
- A controlled six-level hierarchy of factual tasks increasing in complexity from simple city-location recall to double-counting constraints.
- Finding that explicit correctness framing partially aligns truth directions across task families.
- Key improvement in cross-task generalization enabled by explicit instruction framing.