finding
active
finding:math-and-code-tasks-show-strongest-mid-layer-anchoring-on-llama-s-1-65-at-layers-8-12Math and code tasks show strongest mid-layer anchoring on LLaMA (S ≈ −1.65 at layers 8-12)
Task-specific E3 finding showing compositional reasoning requires deeper processing
Source paper
extracted_from(2025) · Edward Yi Chang · Kaya, Zeyneb N. · Ethan Chang
Neighborhood — ranked by edge-count
Claims (3)
claim
- Main interpretation of E3.
- Claim that geometry-to-behavior correlates exist
- Task-specific interpretation of E3 anchoring pattern differences
Communities (3)
community
- Few-shot anchoring & latent structuremembers_ofHow minimal examples disambiguate and recruit latent arithmetic/reasoning interpretations in LLMs
- Silhouette-based metrics (Sbmax, AUSN) across LLM layers predict task accuracy and few-shot thresholds.
- Studies how internal layer-wise geometric properties (anchoring, clustering trajectories, geometry summaries) peak in middle layers and predict downstream task performance across LLMs and shallow networks.
Concepts (2)
concept
- Gemma-3-4B-itassociated_withBackbone model used in E3 robustness overlay.
- Phi-4associated_withBackbone model used in E3 robustness overlay.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- E3 result establishing the Goldilocks zone at mid-layers for LLaMA architecture
- Interpretation of E3 layer-wise results; motivates targeted UCCT interventions at layers 8-12
- LLaMA-3.1-8B: Sbmax = -1.896 ± 0.211, AUSN = -2.119 ± 0.198, peak layer ℓ* = 10 (median)finding0.816Seed-pooled geometry-only statistics (per-dev z units).
- Connects this study's results to Schrimpf et al. 2021 and Caucheteux et al. 2022/2023 findings on brain-LLM alignment.
- Core E3 finding validating S as a predictor of anchoring effectiveness
- E3 finding suggesting pattern matching requires less intensive processing than compositional reasoning
- One of the most promising cases; approximately corresponds to the 2/3 layer of LLaMA3.1-8B.
- Optimal activation capping layers for Llama 3.3 70B are layers 56-71 (out of 80) at 25th percentile capfinding0.802Specific implementation finding for Llama capping parameters