claim
active
claim:training-probes-on-statements-and-their-opposites-improves-generalization-by-mitigating-non-truth-features-with-opposite-sign-correlations

Training probes on statements and their opposites improves generalization by mitigating non-truth features with opposite-sign correlations

Explains why cities+neg_cities and larger_than+smaller_than training sets yield better OOD accuracy

Neighborhood — ranked by edge-count

Findings (4)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.