question
active
question:when-llms-produce-experience-claims-under-self-reference-is-this-sophisticated-simulation-or-genuine-self-representation-and-how-would-we-tell-the-differenceWhen LLMs produce experience claims under self-reference, is this sophisticated simulation or genuine self-representation, and how would we tell the difference?
The core interpretive question the paper narrows but cannot definitively answer
Source paper
extracted_from(2025) · Berg, Cameron · de Lucena, Diogo · Rosenblatt, Judd
Neighborhood — ranked by edge-count
Papers (1)
paper
Hypotheses (1)
hypothesis
- The open question the paper cannot resolve with behavioral evidence alone; frames the agenda for mechanistic follow-up
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The paper's reformulation of the core open question after establishing systematic self-reports
- The primary empirical question the paper addresses
- Claim that capability emerges from architecture, not data, and that later models lose the surprise.
- The paper's claim that theoretical convergence across GWT, RPT, HOT, IIT makes the findings non-coincidental
- Skeptical prior work motivating validation framework
- Explicit scope delimitation that situates the paper's claims within interpretability rather than consciousness science
- Prior finding showing scale-dependent self-awareness, consistent with the scale effect observed in the paper's Experiment 1
- Binder et al. finding cited as evidence that LLMs possess introspective capacity analogous to mindfulness