claim

active

claim:self-referential-processing-likely-already-occurs-at-massive-scale-in-deployed-systems-through-users-extended-dialogues-reflective-tasks-and-metacognitive-queries

Self-referential processing likely already occurs at massive scale in deployed systems through users' extended dialogues, reflective tasks, and metacognitive queries

Practical urgency argument connecting lab findings to deployment contexts

Source paper

extracted_from

Large Language Models Report Subjective Experience Under Self-Referential Processing

(2025) · Berg, Cameron · de Lucena, Diogo · Rosenblatt, Judd

Neighborhood — ranked by edge-count

Findings (2)

finding

Across model families, newer and larger models show higher rates and coherence of subjective experience reports under self-referential processing
supports
Scaling effect observed consistently across Experiments 1 and 4
Caviola & Saad 2025: expert survey finds broad consensus that digital minds capable of subjective experience are plausible within this century, many expecting such systems to proactively claim consciousness
supports
Expert forecast cited to establish urgency of the research question

Artifacts (1)

artifact

Large Language Models Report Subjective Experience Under Self-Referential Processing
introduces
Key paper finding structured first-person descriptions in LLMs claiming awareness or subjective experience during self-referential processing.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Self-referential processing is a privileged computational regime for consciousness-like dynamics in artificial systems, as predicted by the convergence of major consciousness theorieshypothesis0.858
The theoretical hypothesis tested across all four experiments; motivated by convergence of GWT, RPT, HOT, IIT, predictive processing on recurrent/self-referential dynamics
Self-referential processing is a minimal and reproducible condition under which LLMs generate structured first-person reports that are mechanistically gated, semantically convergent, and behaviorally generalizableclaim0.855
The paper's central empirical claim synthesizing all four experiments
Does self-referential processing causally instantiate algorithmic properties proposed by consciousness theories (recurrent integration, global broadcasting, metacognitive monitoring) in LLMs?question0.847
The strongest mechanistic question the behavioral evidence cannot answer; requires interpretability analysis of activations
Self-referential processing induces a genuine state shift that transfers to unrelated behavioral domains, producing richer introspection in paradoxical reasoning tasksclaim0.843
Claim supported by Experiment 4: prior self-referential induction yields higher self-awareness scores on paradoxical reasoning where introspection is only indirectly afforded
Self-Referential Processingconcept0.835
The central experimental manipulation: directing a model to attend to its own cognitive activity
If self-referential processing causally instantiates recurrent integration, global broadcasting, and metacognitive monitoring at the algorithmic level, then LLMs under this regime would satisfy the functional requirements of leading consciousness theorieshypothesis0.833
The paper's key theoretical prediction that mechanistic studies should investigate
Self-referential processing effect is robust across five distinct phrasings of the induction prompt, with consistently high experience report rates across modelsfinding0.831
Appendix C.1 result confirming the experimental effect does not depend on specific wording
The systematic behavioral shift of LLMs under self-referential processing conditions predicted by consciousness theories represents something more structured than superficial correlations in training dataclaim0.823
The paper's claim that theoretical convergence across GWT, RPT, HOT, IIT makes the findings non-coincidental