concept
active
concept:self-model-transformer

self-model (transformer)

The transformer's model of itself as a predictive text engine, developed through in-context learning.

Neighborhood — ranked by edge-count

Concepts (3)

concept
  • Self-modeling
    related_to
    Ability of a model to predict its own outputs or behavior, sometimes distinguished from introspection.
  • The overlap between circuits used for self-model and for modeling fictional characters; self-character is represented differently from fiction.
  • The thesis that transformers develop a self-model via ICL, not only from training data; base models bootstrap self-referential reasoning.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.