claim
active
claim:the-middle-layer-residual-stream-features-are-causally-implicated-in-multi-step-reasoningThe middle layer residual stream features are causally implicated in multi-step reasoning.
Features for Kobe Bryant, California, Lakers participate in computing the capital answer.
Source paper
extracted_fromRelated by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Methodological critique of prior work that fixed a single layer for truth probing.
- Architectural observation enabling the entire mathematical framework; the residual stream is purely a sum of linear projections
- Supported by the geometric transition visible in cosine similarity heatmaps for F0-F3.
- We hypothesize earlier-layer interventions allow more downstream computation to process and potentially correct the perturbationhypothesis0.763Post-hoc explanation for why steering at layer 33 rather than layer 50 produced better ESR behavior in Llama-3.3-70B
- Truth directions emerge in earlier layers for factual tasks and later layers for arithmetic tasks.claim0.763Core empirical claim about the layer-dependence of truth direction emergence as a function of task type.
- Vision of the emerging paradigm shift in society.
- Shows that Burger et al.'s layer choice corresponds to a transitional phase, not a universal property.
- Interpretive claim about the locus of reflection in transformer architecture.