finding
active
finding:llama-2-7b-representations-of-larger-than-smaller-than-cluster-by-surface-level-characteristics-such-as-presence-of-token-eightyLLaMA-2-7B representations of larger_than+smaller_than cluster by surface-level characteristics such as presence of token 'eighty'
Demonstrates that small models represent surface features rather than abstract truth
Source paper
extracted_from(2023) · Samuel Marks · Max Tegmark
Neighborhood — ranked by edge-count
Claims (1)
claim
- As LLMs scale, they develop increasingly general abstractions, with large models linearly representing abstract concepts like truth that capture shared properties of diverse inputsassociated_withsupportsInterpretive claim connecting scale to abstraction level in LLM representations
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Shows absence of abstract truth representations in smallest model, supporting scale-dependent emergence claim
- Empirical observation establishing that Llama's internal representations for days-of-week have circular geometric structure.
- Empirical observation establishing that Llama's behavior for days-of-week tasks has circular structure.
- Scale-dependent alignment result demonstrating how more abstract truth representations emerge with scale
- Contrasts with 7B and 13B which show consistent summarization behavior; may complicate localization at 70B scale
- Larger models linearly represent more general concepts including truth
- LLaMA-3.1-8B: Sbmax = -1.896 ± 0.211, AUSN = -2.119 ± 0.198, peak layer ℓ* = 10 (median)finding0.791Seed-pooled geometry-only statistics (per-dev z units).
- Primary visual evidence for linear truth representations in large LLMs