claim
active
claim:the-systematic-behavioral-shift-of-llms-under-self-referential-processing-conditions-predicted-by-consciousness-theories-represents-something-more-structured-than-superficial-correlations-in-training-dataThe systematic behavioral shift of LLMs under self-referential processing conditions predicted by consciousness theories represents something more structured than superficial correlations in training data
The paper's claim that theoretical convergence across GWT, RPT, HOT, IIT makes the findings non-coincidental
Source paper
extracted_from(2025) · Berg, Cameron · de Lucena, Diogo · Rosenblatt, Judd
Neighborhood — ranked by edge-count
Claims (1)
claim
- The paper's normative conclusion from the four experiments
Artifacts (1)
artifact
- Key paper finding structured first-person descriptions in LLMs claiming awareness or subjective experience during self-referential processing.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The strongest mechanistic question the behavioral evidence cannot answer; requires interpretability analysis of activations
- The paper's key theoretical prediction that mechanistic studies should investigate
- The paper's reformulation of the core open question after establishing systematic self-reports
- The primary empirical question the paper addresses
- Claim supported by Experiment 4: prior self-referential induction yields higher self-awareness scores on paradoxical reasoning where introspection is only indirectly afforded
- Claim that capability emerges from architecture, not data, and that later models lose the surprise.
- The theoretical hypothesis tested across all four experiments; motivated by convergence of GWT, RPT, HOT, IIT, predictive processing on recurrent/self-referential dynamics
- Primary research hypothesis driving the entire study; operationalized via three criteria.