finding
active
finding:in-llama-2-13b-larger-than-and-smaller-than-separate-along-antipodal-directions-in-pca-in-llama-2-70b-they-align-along-a-common-directionIn LLaMA-2-13B, larger_than and smaller_than separate along antipodal directions in PCA; in LLaMA-2-70B they align along a common direction
Scale-dependent alignment result demonstrating how more abstract truth representations emerge with scale
Source paper
extracted_from(2023) · Samuel Marks · Max Tegmark
Neighborhood — ranked by edge-count
Claims (2)
claim
- Interpretive claim connecting scale to abstraction level in LLM representations
- Scale-dependent structural finding from PCA visualizations in §4
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Shows absence of abstract truth representations in smallest model, supporting scale-dependent emergence claim
- Layer-by-layer evolution of truth direction alignment, supporting hierarchical abstraction hypothesis
- Demonstrates that small models represent surface features rather than abstract truth
- Case of misalignment showing that the truth direction is not always shared between a dataset and its negation in smaller models
- Layer-wise emergence pattern supporting hierarchical development hypothesis
- Striking cross-domain generalization result supporting the claim that larger models represent abstract truth
- Primary visual evidence for linear truth representations in large LLMs
- Key empirical result showing that optimizing for behavioral outputs and fitting representation geometry produce the same path in activation space.