finding
active
finding:enhancement-steering-consistently-underperforms-compared-to-directly-providing-explicit-reflection-instructions-across-all-tested-conditions

Enhancement steering consistently underperforms compared to directly providing explicit reflection instructions across all tested conditions

Shows that activation steering does not fully replicate mechanisms triggered by explicit prompting.

Source paper

extracted_from
Unveiling the Latent Directions of Reflection in Large Language Models
(2025) · Chang, Fu-Chieh · Lee, Yu-Ting · Wu, Pei-Yuan

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.