concept
active
concept:reward-functionReward Function
In RL, a scalar signal from the environment that defines the agent's goal; in active inference, reward is just another observation with associated preference.
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Stressassociated_withDefined as discrepancy between current and optimal state; key driver of homeostatic action and intelligence across all systems.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Seven categories determined by which components of f[h] are activated: Objective only, Expect only, Compare only, and combinations
- The practical, working aspect of a building; reinterpreted as the dynamic harmony of moving centers.
- The seven categories (Objective only, Expect only, Compare only, and four combinations) structuring the experiment
- Subjective reward signal from Dubey et al. 2022 balancing objective reward, expectations, and comparisons; extended in this paper
- Framework from Singh, Lewis, and Barto 2009 used to select best-performing reward functions via grid search
- The increase in reward during training, whose dynamics align with those of causal emergence in successful agents.
- In machine learning, a function measuring the distance between current and desired output; analogous to stress.
- Motivates active inference's solution: learning prior preferences from interaction rather than external specification.