claim

active

claim:active-inference-agents-can-learn-their-own-reward-function-prior-preferences-by-interacting-with-the-environment-bypassing-the-need-for-an-explicit-reward-signal

Active inference agents can learn their own reward function (prior preferences) by interacting with the environment, bypassing the need for an explicit reward signal.

Abstract and §3, preference learning section.

Source paper

extracted_from

Active inference: demystified and compared

(2021) · Noor Sajid · Philip J. Ball · Thomas Parr · Karl J. Friston

Neighborhood — ranked by edge-count

Findings (3)

finding

Active inference agents engage in information-seeking behavior in reward-free FrozenLake environments, contrasting with Q-learning but similar to Bayesian RL.
supports
Empirical demonstration on FrozenLake; shows epistemic value drives exploration absent reward signal.
Active inference agent with learnable preferences developed a strict preference for goals (score +) when the Frisbee location was encountered first, becoming a goal-seeking agent.
supports
Figure 5.4 and text.
Active Inference null model (no prior preferences) achieved average score 50.03 [49.70, 50.35] in deterministic FrozenLake.
supports
Table 1.

Communities (2)

community

Active inference & agent ecology
members_of
Free energy minimization, Markov blankets, trust gradients, and multi-agent rhythm/deferral frameworks
Active inference & free energy minimization
members_of
Friston's framework unifying perception, action, and learning under variational free energy minimization.

Frameworks (1)

framework

Active Inference
supports
Foundational framework by Karl Friston; the paper extends it to three hierarchical levels for modeling meta-awareness.

Questions (1)

question

Can active inference agents learn their own prior preferences without explicit reward signals?
gates
Question answered by the preference learning experiments.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Active inference agents can learn epistemic habits in the absence of extrinsic rewards through ambiguity minimization.claim0.890
§2, summarizing information-seeking behavior.
In active inference, reward can simply be treated as another observation we have a preference over, rather than a special signal.claim0.876
Abstract; central distinction.
Active inference agents can carry out epistemic exploration and account for uncertainty about their environment in a Bayes-optimal fashion.claim0.851
Abstract and §1, summarizing a key property.
Active inference offers an attractive natural adaptation mechanism for non-stationary environments due to its Bayesian model updating properties.claim0.847
§3, after non-stationary results.
Active inference provides a framework (derived from first principles) for solving and understanding the behavior of autonomous agents.claim0.844
In brief, active inference proposes that agents achieve this by optimising two complementary objective functions, a variational free energy and an expected free energy.quote0.844
Concise statement of the core hypothesis from Section 2.
How does active inference compare to reinforcement learning in environments with no rewards or uninformative prior preferences?question0.841
Core question addressed by the simulations when rewards are removed.
Active inference postulates that agents achieve survival by optimising two complementary objective functions, a variational free energy and an expected free energy.claim0.840
Core claim of active inference stated in Section 2.