paper:the-machine-consciousness-hypothesisThe Machine Consciousness Hypothesis
TL;DR
Joscha Bach and Hikari Sorensen argue that consciousness is neither irreducibly mysterious nor epiphenomenal, but is the simplest biological learning algorithm discoverable by evolutionary search on self-organizing substrates — and that this algorithm is in principle implementable on digital hardware. The paper's load-bearing contribution is a three-part theoretical architecture: the Human Consciousness Hypothesis (consciousness as second-order perception serving coherence maximization, formalized via von der Malsburg's 1997 coherence definition), the Conductor Theory (directed attention as a coherence-maximizing operator over competing partial models in working memory), and the Genesis Hypothesis (consciousness precedes and constitutes complex cognition rather than emerging from it, evidenced by its appearance before object tracking in human infants). The paper introduces 'cyberanimism' as a named metaphysical position — the claim that biological spirits, including the human psyche, are literally self-organizing software agents in the sense inaugurated by Church-Turing computationalism, not mere analogies. Anchoring the argument in Olah et al.'s 2020 Universality Hypothesis from mechanistic interpretability — which found convergent functional organization across architecturally distinct vision models and primate visual cortex — the paper contends that a system trained on tasks analogous to those faced by a human infant on a self-organizing substrate would exhibit consciousness as a structural necessity. This implies that the correct test for machine consciousness cannot be behavioral (no Turing-style benchmark suffices), but must verify the presence of a colonizing coherence-maximizing pattern that generates a model of present and presence on a developmentally learning system.
What to take away
- 1. The paper defines consciousness as second-order perception — the non-inferential, synchronous registration that perception is currently taking place — explicitly distinguishing this from Rosenthal's Higher-Order Thought theory and from Block's 1995 access-consciousness.
- 2. The Genesis Hypothesis holds that consciousness is not the product of a mature cognitive architecture but its prerequisite: human infants exhibit conscious awareness before acquiring object tracking or deliberate motor control, meaning evolution found no simpler route to complex learning.
- 3. The Conductor Theory frames conscious attention as a coherence-maximizing operator over competing partial mental models, drawing directly on von der Malsburg's 1997 coherence definition of consciousness and converging with Grossberg's Adaptive Resonance Theory.
- 4. Cyberanimism is introduced as the named metaphysical position that biological spirits — including the human psyche and cellular morphogenetic patterns — are literally self-organizing software agents, not metaphors, grounding the concept in Church-Turing computationalism and the Church 1936 / Schönfinkel 1924 equivalence results.
- 5. Olah et al.'s 2020 Universality Hypothesis (from the OpenAI Circuits team, published in Distill vol. 5 no. 3) is recruited as empirical support: architecturally distinct vision models converge on the same feature hierarchies as primate visual cortex, suggesting problem structure — not substrate — determines learned organization.
- 6. The extended Machine Consciousness Hypothesis states that recreating self-organizing information-processing conditions on digital hardware, while posing tasks requiring intelligent agency analogous to those faced by a human infant, is a concrete experimental path to testing whether consciousness is substrate-independent.
- 7. No behavioral test — including any Turing-style benchmark — can establish machine consciousness, because consciousness is characterized as a specific internal causal structure (a colonizing coherence-maximizing pattern) rather than a performance profile, a claim that directly contradicts the sufficiency of current LLM evaluation regimes.
- 8. An open question the paper raises: whether the search space for consciousness-generating patterns is so large that individual brains cannot discover it alone, requiring co-creative scaffolding by already-conscious organisms across generations — analogous to how natural language cannot be invented by a single cohort of newborns.
- 9. A replicable methodology the paper proposes: implement self-organizing substrate simulations using message-passing reinforcement learners on digital hardware, expose them to developmental task sequences, and test for the emergence of a pattern that increases representational coherence, models present-and-presence, and enables sentient-self formation — treating this as experimental philosophy of mind.
- 10. The paper hypothesizes that the Standard Model (finalized in the 1970s) marks the point at which foundational physics stagnated, and that this stagnation, combined with the extraordinary predictive accuracy of physical theories, entrenched physicalism as a paradigm that structurally excludes consciousness — making the Hard Problem a symptom of metaphysical framework choice rather than an intrinsic feature of consciousness itself.
Peer brief — for seminar discussion
Bach and Sorensen set out to dissolve the Hard Problem by replacing it with a constructive research program. Rather than treating Chalmers's 1995 framing as a permanent obstacle, they argue that the explanatory gap is an artifact of a three-reality confusion — between psychological reality (representations within minds), causal reality (functional patterns like software and money), and physical reality (the causally closed substrate of the Standard Model) — and that collapsing these into a single ontological level is what generates the apparent mystery. The paper's central method is what it calls the extended Machine Consciousness Hypothesis: a proposal to test theories of consciousness by recreating developmental, self-organizing conditions on digital substrates and checking whether the targeted causal pattern — one that maximizes representational coherence, generates a model of present-and-presence, and enables sentient-self formation — emerges. An alternative method the paper does not pursue but could have is direct mechanistic interpretability analysis of existing large language models for structural correlates of the proposed functional signature, along the lines of Olah et al.'s 2020 Circuits work in Distill vol. 5(3), which the paper cites as positive evidence for substrate-independent functional convergence across vision architectures and primate visual cortex. The load-bearing finding is a triple-part hypothesis package: the Human Consciousness Hypothesis (consciousness as second-order perception performing coherence maximization, per von der Malsburg 1997), the Conductor Theory (directed attention as the coherence-orchestrating operator over competing partial models), and the Genesis Hypothesis (consciousness precedes and constitutes cognition rather than emerging from it, evidenced by its onset in human infants prior to object tracking or deliberate agency). Together these recast consciousness as the simplest learning algorithm discoverable by evolutionary search on self-organizing biological substrates — and therefore as something in principle reproducible on any substrate capable of implementing equivalent computation, per the Church-Turing equivalences established by Church 1936 and Schönfinkel 1924. The paper also introduces cyberanimism — the thesis that biological spirits are literally self-organizing software agents, not analogies — as the named metaphysical stance that makes computationalist functionalism coherent without collapsing into either eliminativism or essentialism. It explicitly rejects Anil Seth's 2025 biological naturalism position (Behavioral and Brain Sciences, doi:10.1017/S0140525X25000032) as essentialist. What follows, the authors argue, is that no Turing-style behavioral test can establish machine consciousness, because the target is an internal causal structure, not a performance profile — a claim with direct implications for how current LLM evaluations are interpreted. The most pressing pushback a critical reader would mount is the circularity risk embedded in the Genesis Hypothesis: the claim that consciousness is the prerequisite for complex learning is stated as a conjecture and illustrated with infant development, but no mechanistic account is given of why no simpler route to coherence-maximizing self-organization exists. Without this, the Genesis Hypothesis functions as an assertion that a self-organizing substrate must rediscover consciousness as a fixed point — which is precisely what needs to be demonstrated, not assumed. The paper acknowledges it may be wrong about human consciousness, but the entire experimental program inherits the error if this foundational claim is unfounded. Additionally, the recruitment of Olah et al.'s Universality Hypothesis from vision models as support for substrate-independent consciousness formation involves a significant inferential leap: convergent feature hierarchies in supervised vision systems trained by backpropagation are a long way from the self-organizing, developmentally scaffolded, reinforcement-driven process the paper envisions as consciousness-generating. A seminar interlocutor should press hard on whether the analogy holds across those training regimes.
Frameworks (6)
- Conductor Theory of ConsciousnessThe paper's model that conscious directed attention functions as a conductor of the mental orchestra, resolving incoherence between partial models
- CyberanimismThe paper's coined view that natural spirits are best understood as software — self-organizing, evolving computational agents in living nature
- Extended Machine Consciousness HypothesisExtends HCH by claiming it is possible to search for the consciousness algorithm by recreating analogous conditions on digital hardware
- Genesis HypothesisThe conjecture that consciousness does not result from the organized mind but creates and maintains complex models of reality; forms at the beginning of mental development
- Human Consciousness HypothesisComponent of MCH stating consciousness is a specific dynamic representation in the human mind characterizable by phenomenology and functionality
- Machine Consciousness HypothesisCIMC's central hypothesis: general computational machines with sufficient resources possess the necessary and sufficient means to implement consciousness, verifiable through internal structure analysis
Findings (1)
- Olah et al. (2020) found that automatically trained computer vision models, regardless of architecture and training procedure, all arrive at similar functional structures organizing similar features into similar compositional hierarchies, closely resembling the primate visual cortex.
Empirical finding supporting the Universality Hypothesis; extended by the paper to consciousness
Claims (21)
- The philosophical zombie is incoherent by analogy: claiming zombie electrons — particles that behave identically to real electrons but are not 'actually' electrons — makes no sense, because 'electron' just names observable properties.
Paper's refutation of philosophical zombie concept via functionalist analogy
- We are confronted not with one but at least three notions of reality: psychological reality (representations in the mind), causal reality (functional mechanisms), and physical reality (physics as special case of causal model).
Paper's ontological tripartition used to dissolve the Hard Problem
- It is an open question whether simulating a conscious mind requires far more resolution than the relatively slow and sparse communications between billions of biological neurons, or whether current digital substrates suffice.
Paper identifies as a key uncertainty limiting the Extended Machine Consciousness Hypothesis
- There exists no viable behavioral test for consciousness analogous to the Turing Test for intelligence, because consciousness is a particular internal way to achieve performance, not externally visible performance itself.
Paper identifies as a research gap requiring internal analysis methods rather than behavioral benchmarks
- Epiphenomenalism is not a fruitful position: professions of epiphenomenalist consciousness are uncorrelated to its presence, since the epiphenomenalist's consciousness cannot causally influence their mouth or pen.
Paper's dismissal of epiphenomenalism via internal contradiction
- Neuroscience and mechanistic interpretability have not yet made enough progress to identify neural correlates marking necessary and sufficient conditions of conscious experience in both brains and neural networks.
Paper explicitly identifies this as a current gap requiring alternative experimental approaches
- Conscious attention functions as a conductor of the mental orchestra: drawn to disharmonies, allocating focus, synchronizing disagreements, and transforming cacophony into harmonic coherent reality models.
Paper's operational description of consciousness as conductor — mechanism for the coherence definition
- A computer program is not defined by its language or hardware but as an abstract causal pattern, a dynamic mathematical structure that influences the physical universe without violating its causal closure.
Paper's ontological characterization of software enabling cyberanimism
- It is basically impossible to determine if a computer program generates conscious experience by merely observing its performance; a test for consciousness must take internal structure into account.
Paper's argument against behavioral tests for consciousness, establishing why MCH requires internal analysis
- Human consciousness is tied to a biological learning algorithm — the simplest algorithm discoverable by evolutionary search to train a self-organizing biological substrate to become intelligent.
Core theoretical claim connecting consciousness to biological learning
Hypotheses (4)
- We hypothesize that consciousness may be at the heart of a universal biological learning algorithm — one that runs on self-organizing groups of communicating cells sharing evolutionary incentives — and that it creates rather than results from organized mental architecture.
The Genesis Hypothesis as explicit predictive conjecture
- We tentatively hypothesize that if an artificial system were trained to perform the same tasks leading to consciousness formation in a human infant, the system would exhibit consciousness as well, by analogy with the Universality Hypothesis.
Paper's uncertain extension of mechanistic interpretability universality to consciousness
- We hypothesize that general computational machines with sufficient resources possess the necessary and sufficient means to implement consciousness, and that successful implementation can be established via analysis or testing.
The central hypothesis of the paper
- We hypothesize it is possible to search for the consciousness algorithm by recreating analogous conditions of self-organizing information processing on digital computer hardware while posing tasks requiring intelligent agency.
The Extended Machine Consciousness Hypothesis as an experimental program
Questions (7)
- If we translate the Universality Hypothesis to the problem of consciousness, would it follow that an artificial system trained to perform the same tasks leading to consciousness formation in a human infant would exhibit consciousness as well?
Key open question linking mechanistic interpretability universality to machine consciousness
- Could it be possible to create new forms of consciousness and kinds of minds, capable of experiencing, reflecting and understanding reality at a level far beyond current human communication patterns?
Visionary closing question about the future of consciousness research
- But if the conductor is watching the orchestra, what is watching the conductor? Is this, the need to stabilize itself by observation, the reason for consciousness' reflexive nature?
Paper's open question about the self-referential structure of conscious attention
- Can computers be conscious? Or more specifically, can today's computers fully emulate the way in which organisms compute minds?
Central research question motivating the entire paper
- What does it mean that the lights are on in the inner cinema?
Paper's challenge to Askell's inner cinema metaphor — pointing to the need for a functional account
- What is consciousness, and how does it relate to reality?
Opening question framed as among the most interesting open questions in science and philosophy
- Are insects and plants conscious as well?
Question the paper raises as answerable once necessary and sufficient conditions for consciousness are established
Related work— refs + corpus + external arXiv
Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.
- ≈ 92%
- cimcWhitepaperin corpus≈ 91%
- Machine Consciousness as Pseudoscience: The Myth of Conscious MachinesEduardo C. Garrido-Merch\'an2024≈ 89%
- A Theory of Consciousness from a Theoretical Computer Science Perspective: Insights from the Conscious Turing MachineManuel Blum Lenore Blum2022≈ 89%
- Consciousness is entailed by compositional learning of new causal structures in deep predictive processing systemsV.A. Aksyuk2024≈ 89%
- ≈ 89%
- AI Consciousness is Inevitable: A Theoretical Computer Science PerspectiveLenore Blum and Manuel Blum2026≈ 88%
- A Machine Consciousness architecture based on Deep Learning and Gaussian ProcessesMart\'in Molina Eduardo C. Garrido Merch\'an2020≈ 88%
- ≈ 88%
- On the independence between phenomenal consciousness and computational intelligenceSara Lumbreras Eduardo C. Garrido Merch\'an2022≈ 88%
- Probing for Consciousness in MachinesAchim Schilling, Andreas Maier, Patrick Krauss Mathis Immertreu2024≈ 88%
- ≈ 88%
- Which Consciousness Can Be Artificialized? Local Percept-Perceiver Phenomenon for the Existence of Machine ConsciousnessShri Lal Raghudev Ram Singh2025≈ 87%
- ≈ 87%
- Logical Evaluation of Consciousness: For Incorporating Consciousness into Machine ArchitectureR.R. Panda C.N. Padhy2010≈ 87%
- ≈ 87%
- The Phenomenology of Machine: A Comprehensive Analysis of the Sentience of the OpenAI-o1 Model Integrating Functionalism, Consciousness Theories, Active Inference, and AI ArchitecturesVictoria Violet Hoyle2024≈ 87%
- ≈ 87%
- The biogenic approach to cognitionin corpus2005≈ 86%
- ≈ 84%
- ≈ 84%
- ≈ 84%
- ≈ 83%
- ≈ 83%
- ≈ 83%
- The computational boundary of a 'self': developmental bioelectricity drives multicellularity and scale-free cognitionin corpus2019≈ 83%
- Collective intelligence: A unifying concept for integrating biology across scales and substratesin corpus2024≈ 83%
- ≈ 82%
- ≈ 82%
- Taking AI Welfare Seriouslyin corpus2024≈ 82%