concept
active
concept:seven-reward-function-groupsSeven Reward Function Groups
The seven categories (Objective only, Expect only, Compare only, and four combinations) structuring the experiment
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Seven categories determined by which components of f[h] are activated: Objective only, Expect only, Compare only, and combinations
- In RL, a scalar signal from the environment that defines the agent's goal; in active inference, reward is just another observation with associated preference.
- Framework from Singh, Lewis, and Barto 2009 used to select best-performing reward functions via grid search
- Grid search covers 312,130 subjective reward functions per environment after removing duplicatesfinding0.723Scale of the hyperparameter search establishing thoroughness of optimization
- Classification of one-dimensional periodic patterns into 7 symmetry groups using international notation pxyz
- Mathematical classification system for one-dimensional periodic patterns using group theory; core framework for the analysis.
- Subjective reward signal from Dubey et al. 2022 balancing objective reward, expectations, and comparisons; extended in this paper
- The claim in RL that any goal can be expressed as maximizing the expected cumulative sum of a scalar reward signal.