concept
active
concept:gpt-4-1

GPT-4.1

OpenAI model tested in Experiments 1, 3, 4; shows 100% experience reporting under self-referential induction

Neighborhood — ranked by edge-count

Concepts (4)

concept
  • GPT-4
    related_to
    Large language model underlying ChatGPT and Bing Chat; used for illustrative quotes in the paper
  • GPT-3
    related_to
    Large language model cited as an example; also used in Andreas 2022 for preliminary evidence
  • GPT-4V
    related_to
    Example of unified multimodal system handling both images and text with a combined architecture
  • GPT-4 Turbo
    related_to
    OpenAI model tested; shows no alignment faking due to insufficient detailed reasoning

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • GPT-2concept0.838
    Early large language model cited as an example of transformer-based LLMs
  • GPT-4 was used to generate unique variations of cheap/expensive items and room names for the test dataset
  • Is GPT corrigible?question0.780
    Disambiguation exercise.
  • GPT-OSS 120Bconcept0.779
    One of two large reasoning models analyzed in the paper for performative vs genuine CoT behavior
  • Frontier LLM used at temperature 0 to score SJT responses on 1-5 Likert scale conditioned on construct definition and SJT stem
  • A family of large language models trained on next-token prediction, central example of simulators.
  • Disambiguation exercise.
  • IIT 4.0framework0.765
    Version 4.0 of IIT, used to compute Φ and Φ-structure from LLM representation networks; latest iteration at time of study.