Reinforcement Learning or Active Inference?

ByKarl Friston·Jean Daunizeau·Stefan J. Kiebel

DOI 10.1371/journal.pone.0006421 OpenAlex W2137411342

Original abstract (expand)

This paper questions the need for reinforcement learning or control theory when optimising behaviour. We show that it is fairly simple to teach an agent complicated and adaptive behaviours using a free-energy formulation of perception. In this formulation, agents adjust their internal states and sampling of the environment to minimize their free-energy. Such agents learn causal structure in the environment and sample it in an adaptive and self-supervised fashion. This results in behavioural policies that reproduce those optimised by reinforcement learning and dynamic programming. Critically, we do not need to invoke the notion of reward, value or utility. We illustrate these points by solving a benchmark problem in dynamic programming; namely the mountain-car problem, using active perception or inference under the free-energy principle. The ensuing proof-of-concept may be important because the free-energy formulation furnishes a unified account of both action and perception and may speak to a reappraisal of the role of dopamine in the brain.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

Reinforcement Learning through Active Inference
Beren Millidge, Anil K. Seth, Christopher L. Buckley Alexander Tschantz
2020
≈ 83%
Active inference: demystified and compared
Philip J. Ball, Thomas Parr, Karl J. Friston Noor Sajid
2021
≈ 82%
Distributional Active Inference
Gulcin Baykal, Manuel Hau{\ss}mann, Mustafa Mert \c{C}elikok, Melih Kandemir Abdullah Akg\"ul
2026
≈ 81%
Contrastive Active Inference
Pietro Mazzaglia and Tim Verbelen and Bart Dhoedt
2024
≈ 81%
Active Inference and Reinforcement Learning: A unified inference on continuous state and action spaces under partial observability
Parvin Malekzadeh and Konstantinos N. Plataniotis
2024
≈ 80%
Active inference: demystified and compared
in corpus
2021
≈ 80%
Active Statistical Inference
Emmanuel J. Cand\`es Tijana Zrnic
2026
≈ 80%
Learning Perception and Planning with Deep Active Inference
Tim Verbelen, Johannes Nauta, Cedric De Boom and Bart Dhoedt Ozan \c{C}atal
2020
≈ 80%
Prior Preference Learning from Experts:Designing a Reward with Active Inference
Cheolhyeong Kim, Hyung Ju Hwang Jin young Shin
2021
≈ 79%
Active Inference or Control as Inference? A Unifying View
Abraham Imohiosen, Jan Peters Joe Watson
2020
≈ 78%
Bayesian policy selection using active inference
Johannes Nauta, Tim Verbelen, Pieter Simoens and Bart Dhoedt Ozan \c{C}atal
2019
≈ 78%
Expanding the Active Inference Landscape: More Intrinsic Motivations in the Perception-Action Loop
Christian Guckelsberger (2), Christoph Salge (3 and 4), Sim\'on C. Smith (4 and 5), Daniel Polani (4) ((1) Araya Inc., Tokyo, Japan, (2) Computational Creativity Group, Department of Computing, Goldsmiths, University of London, London, UK, (3) Game Innovation Lab, Department of Computer Science and Engineering, New York University, New York City, NY, USA, (4) Sepia Lab, Adaptive Systems Research Group, Department of Computer Science, University of Hertfordshire, Hatfield, UK, (5) Institute of Perception, Action and Behaviour, School of Informatics, The University of Edinburgh, UK) Martin Biehl (1)
2018
≈ 77%
Online reinforcement learning with sparse rewards through an active inference capsule
Charel van Hoof (1), Beren Millidge (2) ((1) Delft University of Technology, (2) University of Oxford) Alejandro Daniel Noel (1)
2021
≈ 77%
Deconstructing deep active inference
Th\'eophile Champion and Marek Grze\'s and Lisa Bonheme and Howard Bowman
2023
≈ 77%
Active inference for action-unaware agents
Keisuke Suzuki, Ryota Kanai, Manuel Baltieri Filippo Torresan
2025
≈ 77%
Reward Maximisation through Discrete Active Inference
Noor Sajid, Thomas Parr, Karl Friston, Ryan Smith Lancelot Da Costa
2022
≈ 77%
Active Inference: A Process Theory
in corpus
2017
≈ 71%
Active inference on discrete state-spaces: a synthesis
in corpus
2020
≈ 70%
A tale of two densities: active inference is enactive inference
in corpus
2020
≈ 70%
Active Inference, Curiosity and Insight
in corpus
2017
≈ 68%
The Causally Emergent Alignment Hypothesis: Causal Emergence Aligns with and Predicts Final Reward in Reinforcement Learning Agents
in corpus
2026
≈ 68%
SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents
in corpus
2025
≈ 67%
Natural Language Autoencoders Produce Unsupervised Explanations of LLM Activations
in corpus
≈ 66%
Multiple ways to implement and infer sentience
in corpus
≈ 65%
Simulators — LessWrong
in corpus
≈ 65%
Why Learning Requires Feeling
in corpus
2026
≈ 65%
A Free energy principle for the brain (lecture summary)
in corpus
2008
≈ 65%
Active Inference with a Self-Prior in the Mirror-Mark Task
in corpus
2026
≈ 65%
Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought
in corpus
2026
≈ 64%
Learning without neurons in physical systems
in corpus
2022
≈ 64%

Similar preprints — Semantic Scholar

Cited by (1)

Active inference on discrete state-spaces: a synthesis
Active inference on discrete state-spaces, formalized as partially observable Markov decision processes (POMDPs) with likelihood matrix A, transition matrix B, and prior D, unifies perception, plannin