finding
active
finding:f3-trained-probes-achieve-auroc-0-6-on-f4-showing-generalization-breakdown-from-counting-over-2-to-5-cities

F3-trained probes achieve AUROC ~0.6 on F4, showing generalization breakdown from counting over 2 to 5 cities.

Demonstrates the sharp drop in factual truth generalization at the counting boundary.

Source paper

extracted_from
Testing the Limits of Truth Directions in LLMs
(2026) · Angelos Poulis · Mark Crovella · Evimaria Terzi

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.