claim
active
claim:contrasting-no-reflection-with-triggered-reflection-0-2-provides-a-stronger-reflection-signal-than-contrasting-intrinsic-with-triggered-reflection-1-2Contrasting No Reflection with Triggered Reflection (µ(0→2)) provides a stronger reflection signal than contrasting Intrinsic with Triggered Reflection (µ(1→2)).
Empirical interpretation of which reference baseline yields more useful steering vectors.
Source paper
extracted_from(2025) · Chang, Fu-Chieh · Lee, Yu-Ting · Wu, Pei-Yuan
Neighborhood — ranked by edge-count
Findings (1)
finding
- Shows that contrasting No Reflection with Triggered Reflection provides a stronger signal than Intrinsic vs Triggered.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Triggered Reflection with 'Alternatively' achieves accuracy .684 on gsm8k_adv for Gemma3-4B-ITfinding0.805Highest single-instruction accuracy result in the paper.
- Central interpretive claim of the paper, supported by steering vector experiments.
- Mechanistic analog connecting Lindsey's layer-localized findings to the scorer's enacted/described distinction
- Core empirical result validating the three-level reflection framework on code reasoning.
- Reflection-inducing directions emerge more clearly in higher layers (ℓ>5) for both models and datasetsfinding0.762Empirical observation about which network layers encode reflection-relevant information.
- Author's interpretation of the negative correlation between reflection rate and accuracy observed in Fig. 5
- Applied dual-use conclusion drawn from the paper's findings.
- µ is an applicative homomorphism: µ(pure a) = pure a and µ(imf <*> imx) = µ imf <*> µ imx.claim0.755Result for Image applicative specification.