method
active
method:softmax-policy-selectionSoftmax policy selection
Selecting policies using a softmax (normalized exponential) function of negative expected free energy.
Neighborhood — ranked by edge-count
Frameworks (1)
framework
- Active InferenceimplementsFoundational framework by Karl Friston; the paper extends it to three hierarchical levels for modeling meta-awareness.
Concepts (1)
concept
- Expected Free EnergyimplementsFree energy expected under future outcomes; guides policy selection via epistemic and extrinsic value.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Policies assigned probability via softmax of expected free energy; enables self-evidencing behavior.
- Choosing sequences of actions based on expected free energy; prior probability of policy is softmax of expected free energy
- Neuronal dynamics computed from free energy gradients; interpreted as average firing rate of neural populations.
- Using normalized log-probabilities from the feedback model as soft targets for preference model training.
- Organisation that hosted the Holistic Intelligence unconference where the paper's ideas originated
- Failure mode for output-surjectivity: LLMs may lack capacity to predict all tokens due to rank constraints
- Using softmax to translate membrane potentials into firing rates, implementing lateral inhibition.
- Implementation detail weighting softmax by log(n_memories) to prevent down-weighting of attention values as memory set grows.