finding
active
finding:training-on-cities-neg-cities-improves-ood-generalization-especially-on-neg-sp-en-transTraining on cities+neg_cities improves OOD generalization, especially on neg_sp_en_trans
Training on statements and their negations mitigates non-truth feature interference in probe directions
Source paper
extracted_from(2023) · Samuel Marks · Max Tegmark
Neighborhood — ranked by edge-count
Claims (1)
claim
- Explains why cities+neg_cities and larger_than+smaller_than training sets yield better OOD accuracy
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Open question about scale-dependent asymmetry in training data effects
- Dissociation between classification accuracy and causal implication; training on opposites does not always help causally
- Proposed conjecture in §4.3.1.
- Layer-by-layer evolution of truth direction alignment, supporting hierarchical abstraction hypothesis
- RNN model recapitulating grid cells; related work category 4.
- Selective pressure toward convergence via task generality
- Yang et al. (2023) result linking EI maximization to robust generalization.
- What factors determine the generalisation of learned alignment maps beyond training data?question0.722Open question about the gap between Theorem 1's existence proof and practical learnability