book
active
book:sutton-and-barto-2018

Sutton and Barto 2018

Standard RL textbook cited for traditional reward function optimization

Extracted from this book

Claims (8)

Findings (8)

Hypotheses (1)

Neighborhood — ranked by edge-count