finding
active
finding:math-code-tasks-s-1-65-at-layers-8-12Math/code tasks S ≈ -1.65 at layers 8–12
Task-specific peak anchoring score for structured reasoning domains.
Source paper
extracted_from(2025) · Edward Yi Chang · Kaya, Zeyneb N. · Ethan Chang
Neighborhood — ranked by edge-count
Claims (1)
claim
- Qualitative characterization of optimal anchoring depth.
Communities (3)
community
- Few-shot anchoring & latent structuremembers_ofHow minimal examples disambiguate and recruit latent arithmetic/reasoning interpretations in LLMs
- Silhouette-based metrics (Sbmax, AUSN) across LLM layers predict task accuracy and few-shot thresholds.
- Studies how internal layer-wise geometric properties (anchoring, clustering trajectories, geometry summaries) peak in middle layers and predict downstream task performance across LLMs and shallow networks.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Task-specific interpretation of E3 anchoring pattern differences
- Layers where anchoring weakens systematically due to representational drift.
- Math and code tasks show strongest mid-layer anchoring on LLaMA (S ≈ −1.65 at layers 8-12)finding0.773Task-specific E3 finding showing compositional reasoning requires deeper processing
- A combinatorial argument that good sequences are astronomically rare, emphasizing the difficulty of discovery.
- Thought detection peaks at ~2/3 layer depth; intention checking peaks at ~1/2 layer depth.finding0.768Lindsey (2026) differential layer performance explained by Janus's path combinatorics — different tasks use different path distributions.
- Shows S predicts anchoring effectiveness.
- Core empirical finding about layer-dependent truth direction emergence across task types.
- Claim that geometry-to-behavior correlates exist