concept
active
concept:large-language-models-llms

Large Language Models (LLMs)

Transformer-based models like GPT-4, LaMDA, PaLM; assessed for GWT indicators.

Neighborhood — ranked by edge-count

Frameworks (1)

framework
  • Neural network architecture based on attention, commonly used in large language models

Claims (1)

claim

Methods (1)

method
  • The mechanism by which LLMs generate text: drawing a token from the next-token distribution and appending it to context repeatedly

Concepts (6)

concept
  • CIMC research direction studying how AI systems develop internal models, form self-representations, and construct coherent personalities from language modeling
  • An LLM embedded in a turn-taking system with a dialogue prompt; the key object of analysis in the paper
  • The contrast class for LLMs: humans acquire language through embodied interaction in communities, unlike disembodied LLMs
  • The training objective of LLMs: predicting the most likely next token given context; formally P(w_{n+1}|w_1...w_n)
  • Simulator
    associated_with
    The underlying LLM with autoregressive sampling; a passive entity capable of generating an infinity of simulacra but lacking its own beliefs or goals
  • The large corpus of human-generated text on which LLMs are trained, which provisions character archetypes and narrative structures

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.