thinker
active
thinker:john-schulman

John Schulman

Cited for scaling laws for reward model overoptimization (2022).

Authored
0
Introduces
1
Studies
0
Affiliations
0
Cited by
0

More papers — OpenAlex / S2

Originates (1)

Recent mentions (3)