concept
active
concept:emergence-of-abstract-representations-with-scaleEmergence of Abstract Representations with Scale
The observation that larger LLMs develop more general, abstract linear representations (e.g., truth across diverse topics) compared to smaller models
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Interpretation of weaker PCA separation and lower ASR in smaller models
- The central question of whether representational geometry implies corresponding computational structure
- Interpretive claim connecting scale to abstraction level in LLM representations
- Alignment faking appears almost exclusively in models at scale of Claude 3 Opus and Claude 3.5 Sonnet
- Causal emergence identification tasks can be understood as causal representation learning tasks.claim0.769Authors propose a conceptual mapping between CE identification and CRL.
- Bedau's tripartite classification: nominal (pattern), weak (computationally irreducible), strong (irreducible downward causation).