method

active

method:star-self-taught-reasoner

STaR (Self-Taught Reasoner)

A method for improving reasoning by self-training on rationales.

Neighborhood — ranked by edge-count

Artifacts (1)

artifact

Simulators (LessWrong post)
mentions
The paper being extracted.

Related by similarity (2)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Synthetic Self-Correction Fine-Tuningmethod0.656
Fine-tuning on Claude-generated self-correction examples with loss masking to induce ESR-like behavior
Raileanu et al. 2018 - Modeling Others Using Oneself in Multi-Agent Reinforcement Learningconcept0.651
Reference for Self-Other Modeling (SOM) framework, a related but less scalable approach to SOO