concept
active
concept:state-action-policiesState-Action Policies
In reinforcement learning, a policy maps states to actions, specifying behavior at each state.
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Reinforcement learning (RL)associated_withMachine learning paradigm where agents learn to maximize cumulative reward through interaction.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Sequence of actions considered by the agent; basis for planning.
- Changing configuration to sample environment differently; minimizes free energy.
- In active inference, a policy is a sequence of actions through time, as opposed to state-action mappings in RL.
- Choosing sequences of actions based on expected free energy; prior probability of policy is softmax of expected free energy
- Coordinated behavior of multiple components required to achieve non-linearly separable outcomes.
- Choice of policies minimizing expected free energy to realize preferred future states.
- Dual interpretation of features: in addition to responding to inputs, features also act to increase probability of specific output tokens