concept
active
concept:out-of-distribution-probe-generalizationOut-of-Distribution Probe Generalization
The capacity of a probe trained on one true/false dataset to accurately classify statements from topically and structurally different datasets
Neighborhood — ranked by edge-count
Concepts (2)
concept
- Out-of-Distribution (OOD) Generalizationrelated_toMachine learning generalization when training and test distributions differ; linked to causal invariance.
- Probe Generalizationrelated_toThe ability of probes trained on one dataset to transfer accurately to topically and structurally different datasets
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- EI and normalized EI could serve as a unified metric for out-of-distribution generalization.claim0.782Conjecture that maximizing EI yields causal representations invariant to distribution shifts.
- Key methodological claim: MM probes are both competitive in accuracy and superior in causal influence
- Ability to apply learned solutions to novel circumstances.
- Ability to respond appropriately to novel situations based on past regularities; fundamental to learning and intelligence.
- Interpretation of scope generalization results
- Generalization from 2-digit to 3-4 digit arithmetic; limited by mismatch dr.
- Linear classifier approach applied to model activations to identify which training datapoints caused undesired behaviors in post-training.
- The ability to generalize across tasks; lacking in latent methods.