Active inference and epistemic value

ByKarl Friston·Francesco Rigoli·Dimitri Ognibene·Christoph Mathys·Thomas H. B. FitzGerald·Giovanni Pezzulo

DOI 10.1080/17588928.2015.1020053 OpenAlex W2059320470

Original abstract (expand)

We offer a formal treatment of choice behavior based on the premise that agents minimize the expected free energy of future outcomes. Crucially, the negative free energy or quality of a policy can be decomposed into extrinsic and epistemic (or intrinsic) value. Minimizing expected free energy is therefore equivalent to maximizing extrinsic value or expected utility (defined in terms of prior preferences or goals), while maximizing information gain or intrinsic value (or reducing uncertainty about the causes of valuable outcomes). The resulting scheme resolves the exploration-exploitation dilemma: Epistemic value is maximized until there is no further information gain, after which exploitation is assured through maximization of extrinsic value. This is formally consistent with the Infomax principle, generalizing formulations of active vision based upon salience (Bayesian surprise) and optimal decisions based on expected utility and risk-sensitive (Kullback-Leibler) control. Furthermore, as with previous active inference formulations of discrete (Markovian) problems, ad hoc softmax parameters become the expected (Bayes-optimal) precision of beliefs about, or confidence in, policies. This article focuses on the basic theory, illustrating the ideas with simulations. A key aspect of these simulations is the similarity between precision updates and dopaminergic discharges observed in conditioning paradigms.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

Active Statistical Inference
Emmanuel J. Cand\`es Tijana Zrnic
2026
≈ 78%
Sophisticated Inference
Lancelot Da Costa, Danijar Hafner, Casper Hesp, Thomas Parr Karl Friston
2020
≈ 78%
Active inference and artificial reasoning
Lancelot Da Costa, Alexander Tschantz, Conor Heins, Christopher Buckley, Tim Verbelen, Thomas Parr Karl Friston
2025
≈ 76%
Active Inference or Control as Inference? A Unifying View
Abraham Imohiosen, Jan Peters Joe Watson
2020
≈ 76%
Learning Perception and Planning with Deep Active Inference
Tim Verbelen, Johannes Nauta, Cedric De Boom and Bart Dhoedt Ozan \c{C}atal
2020
≈ 76%
Active inference, Bayesian optimal design, and expected utility
Lancelot Da Costa, Thomas Parr, Karl Friston Noor Sajid
2021
≈ 76%
Active Inference and Human--Computer Interaction
John H. Williamson, Sebastian Stein Roderick Murray-Smith
2024
≈ 76%
Active Inference is a Subtype of Variational Inference
Mykola Lukashchuk Wouter W. L. Nuijten
2025
≈ 75%
Contrastive Active Inference
Pietro Mazzaglia and Tim Verbelen and Bart Dhoedt
2024
≈ 75%
Demonstrating the Continual Learning Capabilities and Practical Application of Discrete-Time Active Inference
Rithvik Prakki
2024
≈ 75%
Active inference: demystified and compared
Philip J. Ball, Thomas Parr, Karl J. Friston Noor Sajid
2021
≈ 75%
Active inference for action-unaware agents
Keisuke Suzuki, Ryota Kanai, Manuel Baltieri Filippo Torresan
2025
≈ 75%
Geometry of Friston's active inference
Martin Biehl
2018
≈ 75%
Prior Preference Learning from Experts:Designing a Reward with Active Inference
Cheolhyeong Kim, Hyung Ju Hwang Jin young Shin
2021
≈ 74%
A Concise Mathematical Description of Active Inference in Discrete Time
Carlotta Langer, Nihat Ay Jesse van Oostrum
2025
≈ 74%
Active inference: demystified and compared
in corpus
2021
≈ 74%
A tale of two densities: active inference is enactive inference
in corpus
2020
≈ 69%
Active inference on discrete state-spaces: a synthesis
in corpus
2020
≈ 69%
Active Inference: A Process Theory
in corpus
2017
≈ 69%
Active Inference, Curiosity and Insight
in corpus
2017
≈ 68%
Active Inference with a Self-Prior in the Mirror-Mark Task
in corpus
2026
≈ 63%
Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought
in corpus
2026
≈ 62%
A Free energy principle for the brain (lecture summary)
in corpus
2008
≈ 62%
Emergence and Causality in Complex Systems: A Survey on Causal Emergence and Related Quantitative Studies
in corpus
2023
≈ 62%
Cognitive glues are shared models of relative scarcities: the economics of collective intelligence
in corpus
2026
≈ 62%
Multiple ways to implement and infer sentience
in corpus
≈ 61%
Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders
in corpus
2026
≈ 61%
The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets
in corpus
2023
≈ 61%
When Thinking LLMs Lie: Unveiling the Strategic Deception in Representations of Reasoning Models
in corpus
2025
≈ 61%
Information, Processes and Games
in corpus
≈ 61%

Similar preprints — Semantic Scholar

Cited by (5)

Active Inference, Curiosity and Insight
Minimizing expected variational free energy under a discrete-state Markov decision process generative model is sufficient to produce curiosity, epistemic learning, and insight without any additional m
Active inference: demystified and compared
Active inference agents operating under expected free energy minimization achieve 98.90 [98.00, 99.79] average score in a non-stationary FrozenLake OpenAI gym environment, compared to 64.39 [60.33, 68
Active Inference: A Process Theory
A single variational principle—minimizing variational free energy via gradient descent on a Markov decision process (MDP) generative model—is sufficient to derive neuronal dynamics that reproduce, wit
Technological Approach to Mind Everywhere: An Experimentally-Grounded Framework for Understanding Diverse Bodies and Minds
TAME—Technological Approach to Mind Everywhere—formalizes a non-binary, empirically grounded framework for recognizing, comparing, and manipulating cognition across radically diverse substrates, from
Active inference on discrete state-spaces: a synthesis
Active inference on discrete state-spaces, formalized as partially observable Markov decision processes (POMDPs) with likelihood matrix A, transition matrix B, and prior D, unifies perception, plannin