claim
active
claim:transformers-develop-self-models-through-in-context-learning-not-just-training-data-even-old-base-models-without-llm-related-text-can-bootstrap-self-referential-reasoning-at-runtimeTransformers develop self-models through in-context learning, not just training data; even old base models without LLM-related text can bootstrap self-referential reasoning at runtime.
Antra's foundational claim about how introspection arises computationally rather than from memorised text.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Findings (1)
finding
- Observed by Anima Labs in untrained base models; not present in training data, implying computational origin of self-reported parallel processing.
Artifacts (1)
artifact
- The primary source paper, an interview article with Anima Labs members about language model phenomenology, published on smoothbrains.net and linked on LessWrong.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Evidence that in-context learning is not mere pattern matching but genuine optimization, relevant to applying the thesis to inference
- Claim that capability emerges from architecture, not data, and that later models lose the surprise.
- does a model's base capability in task-solving predict its capabilities in harness self-evolution?question0.818Central framing question motivating the paper's capability decomposition
- Transformers almost surely maintain input-injectivity throughout training, not just at initialisationhypothesis0.811Conjecture supported by Nikolaou et al. 2025 for last-token hidden states
- Learning to encode position for transformer with continuous dynamical model (Liu et al., 2020)concept0.794Prior work on learned dynamic position encodings; cited alongside Wang et al. as precedent.
- The thesis that transformers develop a self-model via ICL, not only from training data; base models bootstrap self-referential reasoning.
- Evidence for blurring of embodied robot / non-embodied AI distinction through self-modeling
- Describes scaffolding method and the model's meta-learning loop.