method
active
method:value-iterationValue Iteration
A dynamic programming method for computing optimal value functions and policies in known MDPs.
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Probability of sensory input expected by an agent, aligning value maximization with surprise minimization.
- Meaning that arises from relations within the graphical system, not inherent in elements.
- Negative of value, equated with free-energy and surprise.
- The reappearance of similar elements, essential for unity and order in a living floor or ceiling.
- Spreadsheet-like rule defining how a rectangle's or object's value is computed; enables data-driven behavior across all Playground tools.
- Second training stage: samples responses, filters for type hints, and fine-tunes on filtered responses across four rounds to reinforce evaluation behavior.
- Expected information gain about hidden states; drives curiosity and novelty-seeking; mutual information term in expected free energy.
- Economic concept related to epistemic value in expected free energy; information needed to realize rewards