paper
active
2008
8
paper:friston-2008-free-energy-brain-college-de-france

A Free energy principle for the brain (lecture summary)

TL;DR

Free energy minimization unifies action and perception as two faces of a single optimization principle: an agent suppresses surprise by either updating its internal model (perception) or by acting on the world to sample only expected sensory states (action). Derived from the probabilistic behavior of an ensemble of agents belonging to the same phenotypic class, the free energy bound approximates the log-evidence (marginal likelihood) of a generative model, making it formally equivalent to negative surprise and negative value simultaneously. The lecture series introduces Dynamic Expectation Maximisation (DEM), a variational filtering scheme that inverts nonlinear dynamic causal models in generalised coordinates of motion and yields both time-dependent conditional state densities and time-independent parameter densities, explicitly superseding Kalman and particle filtering for online Bayesian inversion. Presented across three sessions at the Collège de France in May–June 2008, the framework grounds perception in Helmholtz's neural-energy constructs extended through Empirical Bayes and hierarchical generative models, and reframes dopamine not as a reward signal per se but as encoding the conditional precision—certainty—of predictions, consistent with its role in balancing bottom-up sensory drive against top-down empirical priors. This implies that classical and operant conditioning introduce statistical regularities that are learned by the same hierarchical inference machinery used for causal structure, and that rewards are simply predictable (low-surprise) stimuli, making value learning a special case of perceptual inference rather than an independent computational faculty.

What to take away

  1. 1. Action and perception are mathematically equivalent under a single free-energy principle: both minimize a variational bound on the log-evidence of an agent's generative model of its sensory inputs.
  2. 2. Free energy, surprise, and negative value are formally identical quantities, meaning maximizing reward and minimizing sensory surprise are the same computation.
  3. 3. The paper introduces Dynamic Expectation Maximisation (DEM), a variational scheme that performs online Bayesian inversion of nonlinear dynamic causal models in generalised coordinates of motion.
  4. 4. DEM furnishes time-dependent conditional state densities and time-independent parameter densities simultaneously, a capability Kalman filtering and particle filtering do not provide in a unified online framework.
  5. 5. The free-energy bound is equivalent to the model's marginal likelihood (log-evidence), directly enabling model selection and Bayesian model averaging without additional approximations.
  6. 6. Dopamine is hypothesised to encode the conditional precision (certainty) of predictions rather than reward per se, modulating the balance between bottom-up sensory signals and top-down empirical priors during perceptual inference.
  7. 7. Classical and operant conditioning paradigms are recast as procedures that introduce statistical regularities into the sensorium, learned via the same hierarchical Empirical Bayes machinery used for ordinary causal inference.
  8. 8. The framework is derived from first principles by considering the probabilistic behaviour of an ensemble of agents sharing the same phenotype, grounding it in population-level thermodynamic reasoning rather than postulated objectives.
  9. 9. Hierarchical generative models are the substrate the framework requires of the brain: they allow context-sensitive, dynamic construction of prior expectations rather than fixed priors, which is a replicable architectural commitment for computational modelling.
  10. 10. An open question the lectures raise is whether all neurobiological value and reinforcement substrates can be fully accounted for within the perceptual-inference hierarchy, or whether dedicated value circuitry retains explanatory independence that free-energy minimisation cannot subsume.

Peer brief — for seminar discussion

Friston's 2008 Collège de France lecture summary, spanning three sessions on 29–30 May and 1 June 2008, formalises a single variational principle—free-energy minimisation—that absorbs both perceptual inference and action-selection as special cases. The central claim is that an agent's free energy constitutes a tractable upper bound on the surprise of its sensory exchanges with the environment, and that this bound equals the negative log-evidence of the agent's generative model. Because surprise, free energy, and negative value are numerically identical under this formulation, acting to maximise reward and acting to minimise prediction error are not competing frameworks but the same gradient descent expressed on different variables. The load-bearing technical contribution is Dynamic Expectation Maximisation (DEM), a variational filtering algorithm that inverts nonlinear dynamic causal models by optimising free energy in generalised coordinates of motion, yielding time-dependent conditional state densities and time-independent parameter densities in a single online pass. DEM is explicitly contrasted with Kalman filtering and particle filtering, both of which handle either states or parameters but not both simultaneously in a principled Bayesian way. The hierarchical generative model architecture underpinning DEM instantiates Helmholtz's neural-energy ideas via Empirical Bayes, allowing context-sensitive top-down priors to be constructed dynamically rather than fixed. Two specific mechanistic predictions follow. First, dopamine's functional role is recast: rather than signalling scalar reward prediction error, it encodes the conditional precision—the inverse variance—of predictions, thereby gating the relative weight of bottom-up sensory evidence versus top-down priors. Second, classical and operant conditioning are predicted to be limiting cases of the same hierarchical causal-inference algorithm, with rewards defined simply as statistically predictable (low-surprise) stimuli and aversive events as intrinsically surprising ones. A critical reader would push back on the scope of the precision-weighting dopamine claim: the 2008 summary cites no quantitative neural data or model fits against electrophysiology, making it an interpretive reframing rather than a tested prediction. The framework could in principle have been evaluated here using existing dynamic causal modelling datasets from fMRI or local-field-potential recordings—an alternative the summary does not engage. More broadly, collapsing value and surprise into one scalar sidesteps the question of how the agent specifies which sensory states count as expected (i.e., how phenotypically appropriate priors are set), a degrees-of-freedom problem that the 3-page summary leaves unresolved and which subsequent literature has contested extensively.

Methods (3)

Frameworks (1)

  • Free Energy Principle
    A foundational variational principle from statistical physics that formalizes how self-organizing systems maintain structural integrity and adapt to their environment by minimizing free energy—a mathematical bound on surprise or prediction error. Originally developed by Karl Friston, the framework unifies action, perception, and learning as processes of active inference, where systems both update internal models of the world and act upon it to reduce the divergence between predictions and observations.

Claims (9)

Original abstract (expand)

Action, perception and free-energy (Thursday May 29th) Value-learning and perceptual learning have been an important focus over the past decade, attracting the concerted attention of experimental psychologists, neurobiologists and the machine learning community. Despite some formal connections; e.g., the role of prediction error in optimising some function of sensory states, both fields have developed their own rhetoric and postulates. In work, we show that perceptual learning is, literally,...

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

Similar preprints — Semantic Scholar

Cross-corpus bridges (12)

same_concept_as · Nomic cosine

External markdown files that talk about the same concept as this entity.

  • aboutblank_kb
    Free Energy Principle And Active Inferenceframeworks/free-energy-principle-and-active-inference.md0.881
  • aboutblank_kb
    Free Energy Principleframeworks/free-energy-principle.md0.862
  • aboutblank_kb
    Does the Free Energy Principle adequately explain morphogenesis and pattern formation in biological systems?questions/does-the-free-energy-principle-adequately-explain-morphogenesis.md0.853
  • aboutblank_kb
    Active Inferenceframeworks/active-inference.md0.850
  • aboutblank_kb
    Surprise Minimization Frameworkframeworks/surprise-minimization-framework.md0.840
  • aboutblank_kb
    Free Energy Minimizationframeworks/free-energy-minimization.md0.836
  • aboutblank_kb
    Free Energy Principleconcepts/cognitive/free-energy-principle.md0.835
  • aboutblank_kb
    Free-Energy Principle Applied To Morphogenesisframeworks/free-energy-principle-applied-to-morphogenesis.md0.821
  • aboutblank_kb
    Can the Free Energy Principle be extended to explain consciousness itself, and what role does predictive processing play in subjective experience?questions/can-the-free-energy-principle-be-extended-to.md0.817
  • aboutblank_kb
    Free-Energy Approach To Pattern Regulationconcepts/systems/free-energy-approach-to-pattern-regulation.md0.804
  • aboutblank_kb
    Free-Energy Approach To Pattern Regulationframeworks/free-energy-approach-to-pattern-regulation.md0.797
  • alexander
    An association-based model of dynamic behaviourpapers/extracted/2022-04-05_Stefan-Lesser_tr2011003_abmdb.pdf_993054.md0.753