framework
active
framework:optimal-reward-frameworkOptimal Reward Framework
Framework from Singh, Lewis, and Barto 2009 used to select best-performing reward functions via grid search
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (1)
concept
- Source of the optimal reward framework used to evaluate and select best reward functions
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- In RL, a scalar signal from the environment that defines the agent's goal; in active inference, reward is just another observation with associated preference.
- Seven categories determined by which components of f[h] are activated: Objective only, Expect only, Compare only, and combinations
- The claim in RL that any goal can be expressed as maximizing the expected cumulative sum of a scalar reward signal.
- The seven categories (Objective only, Expect only, Compare only, and four combinations) structuring the experiment
- The increase in reward during training, whose dynamics align with those of causal emergence in successful agents.
- Motivation claim positioning this paper against standard RL approaches
- 1984 Ashton-Tate integrated system with frames, FRED language, and overlapping windows; design reference for Playground's approach.