finding
active
finding:reflection-direction-features-achieve-auroc-0-772-vs-0-736-for-final-layer-baseline-on-deepseek-llama-8b-on-gsm8k-correctness-prediction

Reflection direction features achieve AUROC 0.772 vs. 0.736 for final layer baseline on deepseek-llama-8b on GSM8k correctness prediction

Supports claim that uncertainty is encoded in reflection direction

Source paper

extracted_from
ReflCtrl: Controlling LLM Reflection via Representation Engineering
(2025) · Ge Yan · Sun, Chung-En · Tsui-Wei · Weng

Neighborhood — ranked by edge-count

Claims (1)

claim

Hypotheses (1)

hypothesis

Questions (1)

question

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.