Input–output maps are strongly biased towards simple outputs

ByKamaludin Dingle·Chico Q. Camargo·Ard A. Louis

DOI 10.1038/s41467-018-03101-6 OpenAlex W2794316594

Original abstract (expand)

Many systems in nature can be described using discrete input-output maps. Without knowing details about a map, there may seem to be no a priori reason to expect that a randomly chosen input would be more likely to generate one output over another. Here, by extending fundamental results from algorithmic information theory, we show instead that for many real-world maps, the a priori probability P(x) that randomly sampled inputs generate a particular output x decays exponentially with the approximate Kolmogorov complexity [Formula: see text] of that output. These input-output maps are biased towards simplicity. We derive an upper bound P(x) ≲ [Formula: see text], which is tight for most inputs. The constants a and b, as well as many properties of P(x), can be predicted with minimal knowledge of the map. We explore this strong bias towards simple outputs in systems ranging from the folding of RNA secondary structures to systems of coupled ordinary differential equations to a stochastic financial trading model.

Related work— refs + corpus + external arXiv

Cited / in-corpus / arXiv badges show which signals surfaced each row. Multi-source rows weighted higher.

An Evaluation on Large Language Model Outputs: Discourse and Memorization
Xun Wang, Alex Sokolov, Qilong Gu and Si-Qing Chen Adrian de Wynter
2026
≈ 70%
The Non-Linear Representation Dilemma: Is Causal Abstraction Enough for Mechanistic Interpretability?
in corpus
2025
≈ 70%
Input Order Shapes LLM Semantic Alignment in Multi-Document Summarization
Jing Ma
2025
≈ 70%
When Fair Ranking Meets Uncertain Inference
Ritam Dutt, Christo Wilson Avijit Ghosh
2026
≈ 69%
Towards Unified Attribution in Explainable AI, Data-Centric AI, and Mechanistic Interpretability
Tessa Han, Usha Bhalla, Himabindu Lakkaraju Shichang Zhang
2025
≈ 69%
Analyze Feature Flow to Enhance Interpretation and Steering in Language Models
Nikita Balagansky, Yaroslav Aksenov, Daniil Gavrilov Daniil Laptev
2025
≈ 69%
A Unified View of Causal and Non-causal Feature Selection
Lin Liu, and Jiuyong Li Kui Yu
2018
≈ 69%
CLIPDraw: Exploring Text-to-Drawing Synthesis through Language-Image Encoders
L.B. Soros, Olaf Witkowski Kevin Frans
2021
≈ 69%
(Im)possibility of Collective Intelligence
Krikamol Muandet
2025
≈ 69%
Mechanistic Indicators of Steering Effectiveness in Large Language Models
Hao Xue, Flora Salim Mehdi Jafari
2026
≈ 68%
Causal Interpretation of Sparse Autoencoder Features in Vision
Yearim Kim, Nojun Kwak Sangyu Han
2025
≈ 68%
Locating Demographic Bias at the Attention-Head Level in CLIP's Vision Encoder
Kittipat Phunjanna, Marcos Escudero Vi\~nolo, Catarina Barata, Jenny Benois-Pineau Alaa Yasser
2026
≈ 68%
Improving Steering Vectors by Targeting Sparse Autoencoder Features
Sviatoslav Chalnev and Matthew Siu and Arthur Conmy
2024
≈ 68%
Recommender systems, stigmergy, and the tyranny of popularity
Paul E. Smaldino Zackary Okun Dunivin
2025
≈ 68%
Interpretability and Generalization Bounds for Learning Spatial Physics
Theo Gutman-Solo, Shuai Jiang Alejandro Francisco Queiruga
2026
≈ 68%
Visual Representations inside the Language Model
Amita Kamath, Madeleine Grunde-McLaughlin, Winson Han, Ranjay Krishna Benlin Liu
2025
≈ 68%
The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets
in corpus
2023
≈ 66%
Model Alignment Search
in corpus
2025
≈ 66%
Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders
in corpus
2026
≈ 66%
Every Good Regulator of a System Must Be a Model of That System
in corpus
1991
≈ 66%
The Platonic Representation Hypothesis
in corpus
2024
≈ 66%
Why Learning Requires Feeling
in corpus
2026
≈ 65%
Janus Information Flow Transformers 2025
in corpus
≈ 65%
Emergence and Causality in Complex Systems: A Survey on Causal Emergence and Related Quantitative Studies
in corpus
2023
≈ 65%
Active inference: demystified and compared
in corpus
2021
≈ 65%
Addressing divergent representations from causal interventions on neural networks
in corpus
2025
≈ 65%
Psychological Steering of Large Language Models
in corpus
2026
≈ 64%

Similar preprints — Semantic Scholar

Cited by (2)

The Platonic Representation Hypothesis
Neural networks trained on different data modalities, architectures, and objectives are converging toward a shared statistical model of reality — what the paper terms the "platonic representation" — f
The computational boundary of a 'self': developmental bioelectricity drives multicellularity and scale-free cognition
Scale-Free Cognition, the framework introduced here, proposes that any coherent Self is demarcated by a 'cognitive light cone'—a spatio-temporal boundary of events a system can measure, model, and att