finding
active
finding:factual-tasks-f0-f3-reach-near-perfect-auroc-in-early-to-mid-layers-of-llama-3-1-8b-arithmetic-tasks-a1-a3-emerge-much-later-counting-tasks-f4-f5-emerge-late-similar-to-arithmeticFactual tasks F0-F3 reach near-perfect AUROC in early-to-mid layers of Llama-3.1-8B; arithmetic tasks A1-A3 emerge much later; counting tasks F4-F5 emerge late similar to arithmetic.
Core empirical finding about layer-dependent truth direction emergence across task types.
Source paper
extracted_from(2026) · Angelos Poulis · Mark Crovella · Evimaria Terzi
Neighborhood — ranked by edge-count
Claims (1)
claim
- Truth directions emerge in earlier layers for factual tasks and later layers for arithmetic tasks.supportsCore empirical claim about the layer-dependence of truth direction emergence as a function of task type.
Hypotheses (1)
hypothesis
- We hypothesize that LLMs represent correctness of arithmetic expressions differently from factual statements.associated_withCore working hypothesis motivating the factual vs. arithmetic task split in the experimental design.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Shows that explicit instructions delay the emergence of truth directions in arithmetic tasks.
- Key improvement in cross-task generalization enabled by explicit instruction framing.
- Connects this study's results to Schrimpf et al. 2021 and Caucheteux et al. 2022/2023 findings on brain-LLM alignment.
- Contrasts with harder tasks that are sensitive to prompt variations.
- Establishes F3-F5 as a hard generalization boundary that instructions cannot overcome.
- The specific Fourier feature periods identified confirm base-10 rather than modular computation
- One of the most promising cases; approximately corresponds to the 2/3 layer of LLaMA3.1-8B.
- Shows the instruction effect, while shifting geometry, may not produce consistent generalization effects across model families.