Dopamine, reward learning, and active inference

ByThomas H. B. FitzGerald·Raymond J. Dolan·Karl Friston

DOI 10.3389/fncom.2015.00136 OpenAlex W2208959019

Original abstract (expand)

Temporal difference learning models propose phasic dopamine signalling encodes reward prediction errors that drive learning. This is supported by studies where optogenetic stimulation of dopamine neurons can stand in lieu of actual reward. Nevertheless, a large body of data also shows that dopamine is not necessary for learning, and that dopamine depletion primarily affects task performance. We offer a resolution to this paradox based on an hypothesis that dopamine encodes the precision of beliefs about alternative actions, and thus controls the outcome-sensitivity of behaviour. We extend an active inference scheme for solving Markov decision processes to include learning, and show that simulated dopamine dynamics strongly resemble those actually observed during instrumental conditioning. Furthermore, simulated dopamine depletion impairs performance but spares learning, while simulated excitation of dopamine neurons drives reward learning, through aberrant inference about outcome states. Our formal approach provides a novel and parsimonious reconciliation of apparently divergent experimental findings.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

Learning Perception and Planning with Deep Active Inference
Tim Verbelen, Johannes Nauta, Cedric De Boom and Bart Dhoedt Ozan \c{C}atal
2020
≈ 80%
Reinforcement Learning through Active Inference
Beren Millidge, Anil K. Seth, Christopher L. Buckley Alexander Tschantz
2020
≈ 78%
Contrastive Active Inference
Pietro Mazzaglia and Tim Verbelen and Bart Dhoedt
2024
≈ 78%
Prior Preference Learning from Experts:Designing a Reward with Active Inference
Cheolhyeong Kim, Hyung Ju Hwang Jin young Shin
2021
≈ 77%
Expanding the Active Inference Landscape: More Intrinsic Motivations in the Perception-Action Loop
Christian Guckelsberger (2), Christoph Salge (3 and 4), Sim\'on C. Smith (4 and 5), Daniel Polani (4) ((1) Araya Inc., Tokyo, Japan, (2) Computational Creativity Group, Department of Computing, Goldsmiths, University of London, London, UK, (3) Game Innovation Lab, Department of Computer Science and Engineering, New York University, New York City, NY, USA, (4) Sepia Lab, Adaptive Systems Research Group, Department of Computer Science, University of Hertfordshire, Hatfield, UK, (5) Institute of Perception, Action and Behaviour, School of Informatics, The University of Edinburgh, UK) Martin Biehl (1)
2018
≈ 77%
Active inference and artificial reasoning
Lancelot Da Costa, Alexander Tschantz, Conor Heins, Christopher Buckley, Tim Verbelen, Thomas Parr Karl Friston
2025
≈ 77%
Active inference: demystified and compared
Philip J. Ball, Thomas Parr, Karl J. Friston Noor Sajid
2021
≈ 77%
Active Statistical Inference
Emmanuel J. Cand\`es Tijana Zrnic
2026
≈ 77%
A New Approach for Knowledge Generation Using Active Inference
Nazanin Movarraei Jamshid Ghasimi
2025
≈ 77%
Active inference: demystified and compared
in corpus
2021
≈ 76%
Active Inference on the Edge: A Design Study
Victor Casamayor Pujol, Praveen Kumar Donta, Schahram Dustdar Boris Sedlak
2023
≈ 76%
Active Inference or Control as Inference? A Unifying View
Abraham Imohiosen, Jan Peters Joe Watson
2020
≈ 76%
Active inference for action-unaware agents
Keisuke Suzuki, Ryota Kanai, Manuel Baltieri Filippo Torresan
2025
≈ 76%
Distributional Active Inference
Gulcin Baykal, Manuel Hau{\ss}mann, Mustafa Mert \c{C}elikok, Melih Kandemir Abdullah Akg\"ul
2026
≈ 76%
Deconstructing deep active inference
Th\'eophile Champion and Marek Grze\'s and Lisa Bonheme and Howard Bowman
2023
≈ 76%
Active Inference: A Process Theory
in corpus
2017
≈ 76%
Reward Maximisation through Discrete Active Inference
Noor Sajid, Thomas Parr, Karl Friston, Ryan Smith Lancelot Da Costa
2022
≈ 76%
Active inference on discrete state-spaces: a synthesis
in corpus
2020
≈ 75%
A Free energy principle for the brain (lecture summary)
in corpus
2008
≈ 74%
Why Learning Requires Feeling
in corpus
2026
≈ 71%
Active Inference, Curiosity and Insight
in corpus
2017
≈ 71%
A tale of two densities: active inference is enactive inference
in corpus
2020
≈ 71%
Learning without neurons in physical systems
in corpus
2022
≈ 69%
Natural Language Autoencoders Produce Unsupervised Explanations of LLM Activations
in corpus
≈ 68%
The Causally Emergent Alignment Hypothesis: Causal Emergence Aligns with and Predicts Final Reward in Reinforcement Learning Agents
in corpus
2026
≈ 68%
Active Inference with a Self-Prior in the Mirror-Mark Task
in corpus
2026
≈ 68%
Multiple ways to implement and infer sentience
in corpus
≈ 68%
Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought
in corpus
2026
≈ 67%
Exploration Through Introspection: A Self-Aware Reward Model
in corpus
2026
≈ 67%
Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders
in corpus
2026
≈ 67%

Similar preprints — Semantic Scholar

Cited by (2)

Active Inference: A Process Theory
A single variational principle—minimizing variational free energy via gradient descent on a Markov decision process (MDP) generative model—is sufficient to derive neuronal dynamics that reproduce, wit
Active inference on discrete state-spaces: a synthesis
Active inference on discrete state-spaces, formalized as partially observable Markov decision processes (POMDPs) with likelihood matrix A, transition matrix B, and prior D, unifies perception, plannin