finding
active
finding:autoregressive-model-unable-to-converge-to-a-single-stored-pattern-for-any-finite-corollary-2Autoregressive model unable to converge to a single stored pattern for any finite β (Corollary 2)
Consequence of Theorem 3 and 1D no-order result
Source paper
extracted_from(2025) · Francesco Sacco · Dalton A R Sakthivadivel · Michael Levin
Neighborhood — ranked by edge-count
Communities (3)
community
- Spans attention head decomposition, benchmark awareness, and genomic pathogenicity prediction via neural models.
- Theoretical and empirical analysis of why AR language models cannot maintain coherence or convergence beyond their context window through local interactions alone.
- Statistical physics arguments link LLMs' inability to maintain long-range coherence to schizophrenic derailment.
Frameworks (3)
framework
- Autoregressive modelssupportsSecond model system studied; used to show why flat autoregressive LLMs struggle with long-range coherence.
- Potts modelaboutOne of three model systems studied to analyze free-energy scaling and domain-wall formation in self-organizing systems.
- AR(ω) modelaboutStochastic process model predicting next token from a context window of length ω; mapped to local Hamiltonian
Findings (2)
finding
- No ordered phase in 1D with multiple stored patterns
- A unique local Hamiltonian with window length ω can be associated to any AR(ω) model (Theorem 3)supportsMapping autoregressive models to spin systems
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Baseline persistence of any probe direction arising from the autoregressive nature of LLMs, not specific to emotion content
- Statistical technique where outputs are regressed on previous values; used in language generation
- Key limitation of the PRH for non-bijective observations
- Opening sentence defining self-evidencing.
- Analogy between LLM incoherence and schizophrenia symptoms
- Extrapolation of scaling predictive models to AGI.