finding
active
finding:intention-checking-peaks-at-1-2-depth-in-transformersintention checking peaks at ~1/2 depth in transformers
Lindsey (2026) found that intention checking accuracy peaks around half the network depth.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Claims (1)
claim
- Interpretive claim connecting exponential path combinatorics to Lindsey's layer-dependent findings.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Lindsey (2026) found that thought detection accuracy is highest around two-thirds of the network depth.
- Thought detection peaks at ~2/3 layer depth; intention checking peaks at ~1/2 layer depth.finding0.847Lindsey (2026) differential layer performance explained by Janus's path combinatorics — different tasks use different path distributions.
- Interpretive claim from attention head attribution analysis in appendix
- Truth-related directions reliably emerge at 60–75% of normalized layer depth in Qwen and Gemma modelsfinding0.735Experiment 1 finding localizing where truth can be causally mediated
- Triggered Reflection with 'Alternatively' achieves accuracy .684 on gsm8k_adv for Gemma3-4B-ITfinding0.733Highest single-instruction accuracy result in the paper.
- Janus's claim linking path redundancy to interferometric phenomenology.
- The optimal layer for the prefill introspection differs from the optimal layer for detecting injected thoughts.
- Core claim for two-layer models; composition creates qualitatively more powerful in-context learning