The consciousness prior

ByBengio Y

Original abstract (expand)

A new prior is proposed for learning representations of high-level concepts of the kind we manipulate with language. This prior can be combined with other priors in order to help disentangling abstract factors from each other. It is inspired by cognitive neuroscience theories of consciousness, seen as a bottleneck through which just a few elements, after having been selected by attention from a broader pool, are then broadcast and condition further processing, both in perception and decision-making. The set of recently selected elements one becomes aware of is seen as forming a low-dimensional conscious state. This conscious state is combining the few concepts constituting a conscious thought, i.e., what one is immediately conscious of at a particular moment. We claim that this architectural and information-processing constraint corresponds to assumptions about the joint distribution between high-level concepts. To the extent that these assumptions are generally true (and the form of natural language seems consistent with them), they can form a useful prior for representation learning. A low-dimensional thought or conscious state is analogous to a sentence: it involves only a few variables and yet can make a statement with very high probability of being true. This is consistent with a joint distribution (over high-level concepts) which has the form of a sparse factor graph, i.e., where the dependencies captured by each factor of the factor graph involve only very few variables while creating a strong dip in the overall energy function. The consciousness prior also makes it natural to map conscious states to natural language utterances or to express classical AI knowledge in a form similar to facts and rules, albeit capturing uncertainty as well as efficient search mechanisms implemented by attention mechanisms.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

Consciousness: Here, There but Not Everywhere
and Christof Koch Giulio Tononi
2023
≈ 72%
Consciousness is entailed by compositional learning of new causal structures in deep predictive processing systems
V.A. Aksyuk
2024
≈ 71%
A Relativistic Theory of Consciousness (shortened version)
Zachariah A. Neemeh Nir Lahav
2025
≈ 70%
Logical Evaluation of Consciousness: For Incorporating Consciousness into Machine Architecture
R.R. Panda C.N. Padhy
2010
≈ 69%
The mind-brain relationship and the perspective of meaning
Ranjan Mukhopadhyay
2024
≈ 69%
Embodied Consciousness Theory
Jahan N. Schad
2022
≈ 69%
cimcWhitepaper
in corpus
≈ 69%
Introduction to Artificial Consciousness: History, Current Trends and Ethical Challenges
A\"ida Elamrani
2025
≈ 69%
A Theory of Consciousness from a Theoretical Computer Science Perspective: Insights from the Conscious Turing Machine
Manuel Blum Lenore Blum
2022
≈ 69%
Can a Machine be Conscious? Towards Universal Criteria for Machine Consciousness
Cosmin Badea Nur Aizaan Anwar
2024
≈ 69%
Simultaneity of consciousness with physical reality: the key that unlocks the mind-matter problem
John Sanfey
2026
≈ 68%
Technology and Consciousness
John Rushby and Daniel Sanchez
2022
≈ 68%
Brains and where else? Mapping theories of consciousness to unconventional embodiments
in corpus
2026
≈ 68%
Elements of Consciousness and Cognition. Biology, Mathematic, Physics and Panpsychism: an Information Topology Perspective
Pierre Baudot
2018
≈ 68%
On the evolution of phenomenal consciousness
LTCI), Tiziana Zalla (CREA) Jean-Louis Dessalles (INFRES
2011
≈ 67%
Which Consciousness Can Be Artificialized? Local Percept-Perceiver Phenomenon for the Existence of Machine Consciousness
Shri Lal Raghudev Ram Singh
2025
≈ 67%
Consciousness and Automated Reasoning
Ulrike Barthelme{\ss} and Ulrich Furbach and Claudia Schon
2020
≈ 67%
The Machine Consciousness Hypothesis
in corpus
≈ 66%
Consciousness in Artificial Intelligence: Insights from the Science of Consciousness
in corpus
2023
≈ 63%
Biology, Buddhism, and AI: Care as the Driver of Intelligence
in corpus
2022
≈ 61%
Can "consciousness" be observed from large language model (LLM) internal states? Dissecting LLM representations obtained from Theory of Mind test with Integrated Information Theory and Span Representation analysis
in corpus
2025
≈ 61%
The biogenic approach to cognition
in corpus
2005
≈ 61%
Active Inference with a Self-Prior in the Mirror-Mark Task
in corpus
2026
≈ 60%
Can Being Aware of the Illusion of Self Augment an Agent's Affordances: Integrating Buddhist Philosophy, Cognitive Science, and Artificial Life
in corpus
2021
≈ 60%
There is no self-evidence: A physics of emptiness realisation
in corpus
2026
≈ 59%
Sharing the World with Digital Minds
in corpus
≈ 59%
Self-Improvising Memory: A Perspective on Memories as Agential, Dynamically Reinterpreting Cognitive Glue
in corpus
2024
≈ 59%
Technological Approach to Mind Everywhere: An Experimentally-Grounded Framework for Understanding Diverse Bodies and Minds
in corpus
2022
≈ 59%
AI: a Bridge toward Diverse Intelligence and Humanity’s Future
in corpus
2024
≈ 59%
Taking AI Welfare Seriously
in corpus
2024
≈ 58%

Similar preprints — Semantic Scholar

Cited by (3)

Can "consciousness" be observed from large language model (LLM) internal states? Dissecting LLM representations obtained from Theory of Mind test with Integrated Information Theory and Span Representation analysis
Applying Integrated Information Theory (IIT) versions 3.0 and 4.0 to sequences of internal representations from four open-source LLMs — LLaMA3.1-8B, LLaMA3.1-70B, Mistral-7B, and Mixtral-8x7B — across
Consciousness in Artificial Intelligence: Insights from the Science of Consciousness
No current AI system is a strong candidate for phenomenal consciousness, yet there are no obvious technical barriers to building one — this is the central finding of Butlin et al. (2023), a systematic
World models, artificial general intelligence and the hard problems of life–mind continuity: toward a unified understanding of natural and artificial intelligence