method
active
method:logistic-regression-correctness-probe

Logistic regression correctness probe

Logistic regression trained on GSM8k training set to predict answer correctness from projection features along reflection direction

Neighborhood — ranked by edge-count

Frameworks (1)

framework
  • The proposed framework for probing and steering self-reflection behavior in reasoning LLMs via representation engineering

Methods (2)

method

Hypotheses (1)

hypothesis

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.