claim
active
claim:cot-improves-in-distribution-but-may-harm-out-of-distribution-generalizationCoT improves in-distribution but may harm out-of-distribution generalization
Interpretation of scope generalization results
Source paper
extracted_from(2025) · Edward Yi Chang · Kaya, Zeyneb N. · Ethan Chang
Neighborhood — ranked by edge-count
Findings (1)
finding
- Scope generalization results after LoRA+CoT fine-tuning
Communities (2)
community
- CoT effects on generalization, multimodal QA accuracy, and AI safety alignment training.
- Empirical studies showing CoT reasoning improves ID performance while harming OOD generalization, with probability calibration as a mitigation strategy.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- CoT increases dr for OOD operands.
- Machine learning generalization when training and test distributions differ; linked to causal invariance.
- E2 finding showing CoT's limited benefit for OOD transfer, consistent with larger dr out of scope
- EI and normalized EI could serve as a unified metric for out-of-distribution generalization.claim0.773Conjecture that maximizing EI yields causal representations invariant to distribution shifts.
- The capacity of a probe trained on one true/false dataset to accurately classify statements from topically and structurally different datasets
- Section 4.3 describes clamping at 40-60 led to better behavior than clamping at 20-80.
- A promising property for interpretability analysis off-distribution.
- Open question implied by the claim that the method could generalize; empirical validation beyond genomics is not provided in this paper.