concept
active
concept:policy-selectionPolicy Selection
Choosing sequences of actions based on expected free energy; prior probability of policy is softmax of expected free energy
Neighborhood — ranked by edge-count
Concepts (2)
concept
- Policyrelated_toSequence of actions considered by the agent; basis for planning.
- Bayesian Model AveragingimplementsPredictions formed by averaging over policy-specific beliefs, weighted by policy probabilities.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Selecting policies using a softmax (normalized exponential) function of negative expected free energy.
- Comparing models using log-evidence approximated by free energy.
- Choice of policies minimizing expected free energy to realize preferred future states.
- Adapted control task metric measuring difference between odds-ratio on original task and arbitrary-label control task
- RL algorithm used for training models to comply with the conflicting objective
- In active inference, a policy is a sequence of actions through time, as opposed to state-action mappings in RL.
- Choosing among candidate models based on model evidence.
- Points to the meta-cognitive challenge of choosing goal expansion vs. dissolution.