hypothesis
active
hypothesis:reasoning-llms-trigger-reflection-when-their-internal-uncertainty-is-highReasoning LLMs trigger reflection when their internal uncertainty is high
Core hypothesis linking internal uncertainty to self-reflection behavior, tested via probing experiments
Source paper
extracted_from(2025) · Ge Yan · Sun, Chung-En · Tsui-Wei · Weng
Neighborhood — ranked by edge-count
Findings (1)
finding
- Supports claim that uncertainty is encoded in reflection direction
Concepts (2)
concept
- Self-reflectionassociated_withThe ability of reasoning LLMs to review and revise previous reasoning steps during inference
- Internal uncertaintyassociated_withThe model's internal representation of uncertainty hypothesized to trigger self-reflection
Questions (1)
question
- First central research question motivating ReflCtrl investigation
Methods (1)
method
- Logistic regression trained on GSM8k training set to predict answer correctness from projection features along reflection direction
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Open question motivating the entire paper; identified as not yet well understood
- The underlying mechanism of self-reflection in reasoning LLMs is not yet well understoodquestion0.853Broad gap motivating the entire paper
- Core claim of ReflCtrl that a single direction captures and controls reflection
- The core phenomenon studied: the ability of LLMs to evaluate and revise their own reasoning.
- Empirical observation motivating the need to control reflection for inference efficiency
- Binder et al. finding cited as evidence that LLMs possess introspective capacity analogous to mindfulness
- Establishes that the observed linear structure is not merely a representation of text probability