paper
active
2022
paper:doi-10-48550-arxiv-2206-05831

Learning without neurons in physical systems

TL;DR

Physical systems — elastic networks, flow networks, molecular assemblies, and creased sheets — can autonomously learn desired input-output behaviors through purely local learning rules, without any computer or centralized optimizer, a framework Stern and Murugan term 'physical learning.' The core theoretical advance is demonstrating that local rules of the form dw[x,t]/dt ∼ h(s(f;{w})[x,t]) can collectively minimize a global cost function because the physical response s encodes global information locally, bridging the apparent gap between locality and global optimization. Contrastive learning in resistor networks (Dillavou et al. 2021) successfully classifies the 4-feature Iris dataset with accuracy reaching 97% by step 300, and learning capacity in 2D molecular self-assembly systems scales as the square root of the number of distinct molecular species. A Hebbian molecular learning rule — dwij/dt ∼ si(x,t)sj(x,t) — enables self-assembling systems built from 2,500 molecular species to perform pattern recognition on 2,500-pixel images through stochastic nucleation dynamics. Physical learning also leaves diagnostic signatures in its substrates: trained systems develop spatial heterogeneity in elastic moduli, prune unused network edges, generate anomalously few energy minima relative to random disordered systems, and acquire soft modes that reduce response dimensionality. The paper argues these findings imply that physical learning provides a principled, model-free route to inverse-problem solving in materials, that local rules are not fundamentally inferior to global gradient descent, and that the physical signatures of learning — heterogeneity, soft modes, reduced landscape complexity — may serve as empirical markers for identifying naturally evolved or learned physical systems in biology and materials science.

What to take away

  1. 1. Local learning rules of the form dw[x,t]/dt ∼ h(s(f;{w})[x,t]) — where a learning degree of freedom changes only based on the state of the system in its own spacetime vicinity — are sufficient to collectively minimize a global cost function because the physical response encodes global information locally.
  2. 2. Contrastive learning in an experimental resistor network (Dillavou et al. 2021, arXiv:2108.00275) classifies the 4-feature, 3-class Iris dataset, achieving accuracy that rises from ~83% at step 1 to ~97% at step 300, demonstrating supervised physical learning without any computer in the learning loop.
  3. 3. A Hebbian molecular learning rule dwij/dt ∼ si(x,t)sj(x,t), implemented via proximity-based ligation, enables a 2,500-molecular-species system to perform pattern recognition on 2,500-pixel images through stochastic non-equilibrium nucleation dynamics (Zhong, Schwab, Murugan 2017, J. Stat. Phys. 167:806).
  4. 4. Learning capacity in 2D molecular self-assembly systems scales as the square root of the number of distinct molecular species (Murugan et al. 2015, PNAS 112:54), so doubling the number of species increases capacity by a factor of ~1.41.
  5. 5. Even when the learning timescale τlearn is comparable to the physical response timescale τresponse (ratio R = 1), contrastive learning in flow networks still converges successfully; only at rates significantly exceeding R = 1 do underdamped oscillations emerge that may prevent convergence (Stern et al. 2022, Phys. Rev. Research 4:L022037).
  6. 6. Trained physical systems develop exponentially fewer energy minima than random disordered systems of the same size; in learned molecular self-assembly, spurious chimeric minima near capacity are structurally analogous to spurious states in Hopfield associative memory (Hopfield 1982, PNAS 79:2554).
  7. 7. Supervised 'thumbs-up/thumbs-down' learning in disordered creased sheets — where the supervisor selects the sign of the unsupervised rule by placing the sheet in one of two chemical environments — is sufficient to classify subtle spatial force-pattern correlations mapped from the Iris dataset (Stern et al. 2020, PNAS 117:14843).
  8. 8. To replicate contrastive learning experimentally, a researcher can couple two copies of the same resistor network (a 'free' and a 'clamped' network), update each variable resistor Ri by ΔRi = +δR if |ΔViC| > |ΔViF| and −δR otherwise, and drive the system with voltage inputs encoding Iris dataset features, as demonstrated in Wycoff et al. 2022 (J. Chem. Phys. 156:144903).
  9. 9. An open question raised by the paper is whether non-equilibrium physical systems — those without time-reversal symmetry, such as molecular motor-driven cytoskeletal networks — are intrinsically more expressive or show greater parameter degeneracy for a given behavior, which could make them easier to train than equilibrium systems.
  10. 10. Physical learning produces a diagnostic signature of reduced response dimensionality: trained elastic and flow networks develop soft modes (low-eigenvalue normal modes) so that responses to arbitrary random forces are confined to a low-dimensional subspace, a feature absent in randomly parameterized networks of the same topology (Tlusty, Libchaber, Eckmann 2017, Phys. Rev. X 7:021037).

Peer brief — for seminar discussion

Stern and Murugan's 2022 review consolidates and theoretically frames an emerging program in which physical materials — disordered elastic networks, flow networks, creased sheets, and molecular assemblies — autonomously solve inverse problems by experiencing examples of desired behavior, modifying their own parameters through local dynamical rules, with no computer involved in the adaptation loop. The central organizing concept, which they call physical learning, is formalized around three ingredients: physical degrees of freedom that map inputs to states, learning degrees of freedom (bond stiffnesses, tube radii, molecular interaction strengths) that modulate those mappings, and a learning rule dw[x,t]/dt ∼ h(s(f;{w})[x,t]) that updates each learning element based only on the local state of the system at that spacetime point. The load-bearing theoretical claim is that such purely local rules can nevertheless minimize a global cost function, because the physical response of the system encodes global information in local state variables through collective dynamics — no explicit gradient computation is required. The most quantitatively grounded demonstrations span several substrates. A contrastive learning rule implemented in an experimental resistor network (Dillavou et al. 2021) classifies the 4-feature Iris dataset, improving from roughly 83% accuracy at training step 1 to 97% at step 300. A Hebbian molecular rule (dwij/dt ∼ si·sj) applied to a 2,500-species chemical system performs pattern recognition on 2,500-pixel images via stochastic nucleation (Zhong et al. 2017, J. Stat. Phys. 167:806). Learning capacity in 2D molecular self-assembly scales as the square root of the number of distinct species (Murugan et al. 2015, PNAS 112:54). Separate work shows that contrastive learning in flow networks converges even when the learning timescale matches the physical response timescale (Stern et al. 2022, Phys. Rev. Research 4:L022037), relaxing a constraint that in silico ML takes for granted. The framework introduced to unify these results is the classification of physical learning rules into unsupervised (Hebbian, flux-based, directed aging) and supervised (thumbs-up/thumbs-down, contrastive Hebbian) categories, with the contrastive rule identified as the most powerful physically realizable approach to supervised learning. An alternative method the review explicitly sets aside is computer-aided backpropagation as implemented in Wright et al. 2022 (Nature 601:549), which uses physical systems as substrates but retains a computer in the weight-update loop; the review treats this as a related but distinct program. The paper also identifies physical signatures that learning leaves in its substrate — spatial heterogeneity in moduli, topological pruning of unused edges, anomalously few energy minima relative to random disordered systems, and the emergence of soft modes that confine responses to a low-dimensional subspace — and proposes these as empirical markers for detecting learning or evolution in natural physical systems. The prediction the review advances is that non-equilibrium physical systems lacking time-reversal symmetry may prove more expressive than equilibrium ones, because non-equilibrium nucleation dynamics already demonstrably improve pattern-recognition capacity, and non-reciprocal interactions could unlock dynamic-phase learning not accessible at steady state. A critical reader would push back on the scope conflation between genuinely autonomous physical learning and systems that still require substantial human supervision to implement the learning rule. The 'thumbs-up/thumbs-down' protocol in creased sheets requires a human supervisor to decide the sign of the rule for each training example and to physically place the material in different chemical environments accordingly; the contrastive rule in resistor networks requires constructing two coupled copies of the network and engineering the update circuit. These are meaningful engineering achievements, but calling them 'physical learning without computers' overstates autonomy. The degree to which the training protocol itself can be internalized by the physical system — rather than externally administered — remains largely unresolved experimentally, and the review's framing tends to blur this distinction.

Methods (3)

  • Local learning rule
    Learning rule where change in a parameter at point x,t depends only on system state at same or nearby spacetime points, without requiring global cost function computation
  • Molecular Hebbian learning
    Unsupervised learning rule in molecular systems where species i,j with high co-localized concentrations strengthen their interaction strength through proximity-based ligation
  • Spike-Timing-Dependent Plasticity
    Biologically plausible local learning rule constraining the brain; referenced as precedent for locality-constrained learning in physical systems

Frameworks (4)

  • Contrastive learning
    Supervised learning framework where system learns by observing contrast between current response and nudged improved response; requires weak additional forces from supervisor
  • Neuromorphic computing
    Related field where physical elements are modified for desired computational ability; traditionally targets symbolic inputs/outputs unlike physical learning's physical stimuli/responses
  • Physical learning
    Framework for solving inverse problems in which physical systems autonomously adapt their parameters in response to stimuli through local learning rules, without requiring computational design or explicit cost functions
  • Reservoir Computing
    Physical computation framework using fixed complex dynamical systems with trained output filter; contrasted with physical learning's parameter modification approach

Findings (1)

Claims (2)

Hypotheses (1)

Questions (1)

Original abstract (expand)

Learning is traditionally studied in biological or computational systems. This review explores 'physical learning' in which physical systems adopt desirable properties on their own without computational design through local learning rules and autonomous adaptation. Physical learning machines provide practical advantages over computer-designed systems by not requiring an accurate model and their ability to autonomously adapt to changing needs, while also offering a novel perspective on how physical constraints modify abstract learning theory.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

+28 more

Similar preprints — Semantic Scholar

Cited by (1)

Cross-corpus bridges (5)

same_concept_as · Nomic cosine

External markdown files that talk about the same concept as this entity.

  • aboutblank_kb
    How can distributed learning in multi-scale biological systems increase system-level welfare without system-level reward signals?questions/how-can-distributed-learning-in-multiscale-biological-systems.md0.811
  • aboutblank_kb
    How can neural systems adapt their dynamics as new behaviors are learned in novel bodies and environments?questions/how-can-neural-systems-adapt-their-dynamics-as.md0.807
  • aboutblank_kb
    Can learning mechanisms in biological systems be understood through principles of machine learning and optimization?questions/can-learning-mechanisms-in-biological-systems-be-understood.md0.801
  • aboutblank_kb
    Hebbian Learningconcepts/ai/hebbian-learning.md0.801
  • aboutblank_kb
    How do neural systems adapt their dynamics as new behaviors are learned in novel embodiments?questions/how-do-neural-systems-adapt-their-dynamics-as.md0.782