claim
active
claim:antipodal-alignment-between-related-datasets-e-g-larger-than-and-smaller-than-in-smaller-models-resolves-to-common-direction-alignment-in-larger-modelsAntipodal alignment between related datasets (e.g., larger_than and smaller_than) in smaller models resolves to common-direction alignment in larger models
Scale-dependent structural finding from PCA visualizations in §4
Source paper
extracted_from(2023) · Samuel Marks · Max Tegmark
Neighborhood — ranked by edge-count
Findings (2)
finding
- Case of misalignment showing that the truth direction is not always shared between a dataset and its negation in smaller models
- Scale-dependent alignment result demonstrating how more abstract truth representations emerge with scale
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Layer-by-layer evolution of truth direction alignment, supporting hierarchical abstraction hypothesis
- On CIFAR-10, larger models exhibit greater alignment with each other compared to smaller onesfinding0.781Kornblith et al. / Krizhevsky finding replicated in paper discussion
- A mapping assigning to each high-level variable a set of low-level variables and a function from low-level to high-level values.
- The case where two datasets (e.g., larger_than and smaller_than) separate along opposite directions in PCA, indicating a shared feature with opposite sign
- Bigger models are more likely to converge to a shared representation than smaller modelshypothesis0.761Selective pressure toward convergence via model capacity
- Key geometry-to-behavior bridge finding in E3; robust to pooling choice, cosine vs. L2, and frozen external encoder
- Claims that alignment score is a proxy for general capability