quote
active
quote:the-problem-isn-t-that-it-is-a-transformer-the-problem-is-that-it-is-an-auto-regressive-llm-auto-regressive-llms-that-compute-each-token-with-a-fixed-number-of-computational-steps-can-t-reason-regardless-of-the-details-of-the-architectureThe problem isn't that it is a transformer. The problem is that it is an auto-regressive LLM. Auto-regressive LLMs that compute each token with a fixed number of computational steps can't reason, regardless of the details of the architecture.
LeCun's post on X supporting the view that fixed-step probabilistic prediction precludes consciousness in LLMs.
Source paper
extracted_from(2025) · Li, Jingkai
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Core summary of Janus' position on autoregressive recurrence enabling introspection.
- Antra's foundational claim about how introspection arises computationally rather than from memorised text.
- Claim formalizing the Anima Labs idea that transformers are effectively recurrent due to K/V stream.
- Primary conclusion of the study based on temporal permutation analysis failing all three criteria.
- Open question about the nature of the abundant token-in-context features found
- Authors argue the prevalence of token-in-context features reflects genuine model computation rather than dictionary learning artifact
- Assessment from case study.
- Transformer can be viewed as a Wolfram causal graph with foliations specifying computation order.claim0.758Janus's interpretive framing of transformers as causal graphs.