claim
active
claim:active-inference-agents-can-learn-their-own-reward-function-prior-preferences-by-interacting-with-the-environment-bypassing-the-need-for-an-explicit-reward-signalActive inference agents can learn their own reward function (prior preferences) by interacting with the environment, bypassing the need for an explicit reward signal.
Abstract and §3, preference learning section.
Source paper
extracted_from(2021) · Noor Sajid · Philip J. Ball · Thomas Parr · Karl J. Friston
Neighborhood — ranked by edge-count
Findings (3)
finding
- Empirical demonstration on FrozenLake; shows epistemic value drives exploration absent reward signal.
- Figure 5.4 and text.
- Table 1.
Communities (2)
community
- Active inference & agent ecologymembers_ofFree energy minimization, Markov blankets, trust gradients, and multi-agent rhythm/deferral frameworks
- Friston's framework unifying perception, action, and learning under variational free energy minimization.
Frameworks (1)
framework
- Active InferencesupportsFoundational framework by Karl Friston; the paper extends it to three hierarchical levels for modeling meta-awareness.
Questions (1)
question
- Question answered by the preference learning experiments.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- §2, summarizing information-seeking behavior.
- Abstract; central distinction.
- Abstract and §1, summarizing a key property.
- §3, after non-stationary results.
- Concise statement of the core hypothesis from Section 2.
- Core question addressed by the simulations when rewards are removed.
- Core claim of active inference stated in Section 2.