thinker
active
thinker:john-schulmanJohn Schulman
Cited for scaling laws for reward model overoptimization (2022).
Authored
0
Introduces
1
Studies
0
Affiliations
0
Cited by
0
More papers — OpenAlex / S2
Originates (1)
Recent mentions (3)
- papers-typedkim-2026-active-inference.md
- papers-typedgreenblatt-2024-alignment.md
- papers-typedyuntao-2022-cat-s.md