claim
active
claim:reflective-reasoning-requires-late-stage-integration-of-semantic-and-reasoning-signals-hence-reflection-related-directions-emerge-more-clearly-in-higher-network-layersReflective reasoning requires late-stage integration of semantic and reasoning signals, hence reflection-related directions emerge more clearly in higher network layers.
Interpretive claim about the locus of reflection in transformer architecture.
Source paper
extracted_from(2025) · Chang, Fu-Chieh · Lee, Yu-Ting · Wu, Pei-Yuan
Neighborhood — ranked by edge-count
Findings (1)
finding
- Empirical observation about which network layers encode reflection-relevant information.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Core claim of ReflCtrl that a single direction captures and controls reflection
- Theoretical limitation identified by the authors distinguishing reflection from stylistic tasks.
- Key limitation and open question about experimental scope.
- Central interpretive claim of the paper, supported by steering vector experiments.
- Core applied contribution claim, supported by top-k accuracy comparisons.
- Truth directions emerge in earlier layers for factual tasks and later layers for arithmetic tasks.claim0.786Core empirical claim about the layer-dependence of truth direction emergence as a function of task type.
- First central research question motivating ReflCtrl investigation
- Supported by the instruction discovery experiments comparing steering vs. embedding baselines.