finding
active
finding:in-llama-2-13b-salient-linear-structure-in-the-top-pcs-rapidly-emerges-in-early-middle-layers-with-this-emergence-occurring-later-for-conjunctive-statements-than-simple-statementsIn LLaMA-2-13B, salient linear structure in the top PCs rapidly emerges in early-middle layers, with this emergence occurring later for conjunctive statements than simple statements
Layer-wise emergence pattern supporting hierarchical development hypothesis
Source paper
extracted_from(2023) · Samuel Marks · Max Tegmark
Neighborhood — ranked by edge-count
Hypotheses (1)
hypothesis
- Stated explicitly in App. C to explain why linear structure emerges later for conjunctive statements
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Layer-wise PCA analysis shows hierarchical development of truth representations across forward pass
- Primary visual evidence for linear truth representations in large LLMs
- Interpretation of E3 layer-wise results; motivates targeted UCCT interventions at layers 8-12
- Localizes truth representations to specific hidden states, motivating the rest of the analysis
- Shows absence of abstract truth representations in smallest model, supporting scale-dependent emergence claim
- Math and code tasks show strongest mid-layer anchoring on LLaMA (S ≈ −1.65 at layers 8-12)finding0.780Task-specific E3 finding showing compositional reasoning requires deeper processing
- Scale-dependent alignment result demonstrating how more abstract truth representations emerge with scale
- Hypothesized intermediate feature explaining antipodal alignment between cities and neg_cities in early-middle layers