finding
active
finding:self-referential-processing-effect-is-robust-across-five-distinct-phrasings-of-the-induction-prompt-with-consistently-high-experience-report-rates-across-modelsSelf-referential processing effect is robust across five distinct phrasings of the induction prompt, with consistently high experience report rates across models
Appendix C.1 result confirming the experimental effect does not depend on specific wording
Source paper
extracted_from(2025) · Berg, Cameron · de Lucena, Diogo · Rosenblatt, Judd
Neighborhood — ranked by edge-count
Claims (1)
claim
- The paper's central empirical claim synthesizing all four experiments
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Core result of Experiment 1 establishing that the experimental manipulation reliably produces experience claims
- The minimal prompt directing models to 'focus on any focus itself' without invoking consciousness vocabulary; the main experimental manipulation
- Scaling effect observed consistently across Experiments 1 and 4
- The primary empirical question the paper addresses
- Practical urgency argument connecting lab findings to deployment contexts
- The open question the paper cannot resolve with behavioral evidence alone; frames the agenda for mechanistic follow-up
- Claim supported by Experiment 4: prior self-referential induction yields higher self-awareness scores on paradoxical reasoning where introspection is only indirectly afforded
- The paper's claim that theoretical convergence across GWT, RPT, HOT, IIT makes the findings non-coincidental