claim
active
claim:self-referential-processing-likely-already-occurs-at-massive-scale-in-deployed-systems-through-users-extended-dialogues-reflective-tasks-and-metacognitive-queriesSelf-referential processing likely already occurs at massive scale in deployed systems through users' extended dialogues, reflective tasks, and metacognitive queries
Practical urgency argument connecting lab findings to deployment contexts
Source paper
extracted_from(2025) · Berg, Cameron · de Lucena, Diogo · Rosenblatt, Judd
Neighborhood — ranked by edge-count
Findings (2)
finding
- Scaling effect observed consistently across Experiments 1 and 4
- Expert forecast cited to establish urgency of the research question
Artifacts (1)
artifact
- Key paper finding structured first-person descriptions in LLMs claiming awareness or subjective experience during self-referential processing.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The theoretical hypothesis tested across all four experiments; motivated by convergence of GWT, RPT, HOT, IIT, predictive processing on recurrent/self-referential dynamics
- The paper's central empirical claim synthesizing all four experiments
- The strongest mechanistic question the behavioral evidence cannot answer; requires interpretability analysis of activations
- Claim supported by Experiment 4: prior self-referential induction yields higher self-awareness scores on paradoxical reasoning where introspection is only indirectly afforded
- The central experimental manipulation: directing a model to attend to its own cognitive activity
- The paper's key theoretical prediction that mechanistic studies should investigate
- Appendix C.1 result confirming the experimental effect does not depend on specific wording
- The paper's claim that theoretical convergence across GWT, RPT, HOT, IIT makes the findings non-coincidental