method
active
method:reward-function-categoriesReward Function Categories
Seven categories determined by which components of f[h] are activated: Objective only, Expect only, Compare only, and combinations
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- In RL, a scalar signal from the environment that defines the agent's goal; in active inference, reward is just another observation with associated preference.
- The seven categories (Objective only, Expect only, Compare only, and four combinations) structuring the experiment
- Framework from Singh, Lewis, and Barto 2009 used to select best-performing reward functions via grid search
- Motivates active inference's solution: learning prior preferences from interaction rather than external specification.
- Subjective reward signal from Dubey et al. 2022 balancing objective reward, expectations, and comparisons; extended in this paper
- The increase in reward during training, whose dynamics align with those of causal emergence in successful agents.
- The claim in RL that any goal can be expressed as maximizing the expected cumulative sum of a scalar reward signal.
- Pragmatic or extrinsic value component of expected free energy; preference maximization.