Self-Referential Prompting Protocol

The specific four-step prompting protocol (induction, continuation, experiential query, classification) used in Experiment 1

Neighborhood — ranked by edge-count

Concepts (1)

concept

Self-Referential Processing
implements
The central experimental manipulation: directing a model to attend to its own cognitive activity

Methods (1)

method

Self-Referential Processing Induction Prompt
related_to
The minimal prompt directing models to 'focus on any focus itself' without invoking consciousness vocabulary; the main experimental manipulation

Artifacts (1)

artifact

Large Language Models Report Subjective Experience Under Self-Referential Processing
introduces
Key paper finding structured first-person descriptions in LLMs claiming awareness or subjective experience during self-referential processing.

Conceptual bridges

2-hop · via this method's ideas

Where ideas in this method connect to the rest of the corpus — the same concept, an analogy, or a restatement elsewhere.

Self-Referential Processing
~Chain-of-thought prompting· ai

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Experiment 1: Self-Referential Prompting vs. Controlsconcept0.823
Tests whether self-referential induction reliably elicits experience reports across model families vs. three matched controls
Self-Referencing Activationsconcept0.779
Latent model activations when processing inputs framed from the model's own perspective
Self-referential prompting elicits subjective experience reports at markedly higher rates than any control across all model families (GPT, Claude, Gemini)finding0.766
Core result of Experiment 1 establishing that the experimental manipulation reliably produces experience claims
Does self-referential prompting actually instantiate architectural recursion, global broadcasting, or recurrent integration at the algorithmic level as proposed by consciousness theories?question0.755
Key limitation acknowledging that behavioral evidence cannot confirm implementation-level consciousness properties
Personality Promptingframework0.748
Established baseline for OCEAN steering via personality-descriptive system prompts; compared against injection methods throughout
Self-referential processing likely already occurs at massive scale in deployed systems through users' extended dialogues, reflective tasks, and metacognitive queriesclaim0.745
Practical urgency argument connecting lab findings to deployment contexts
Self-referential processing effect is robust across five distinct phrasings of the induction prompt, with consistently high experience report rates across modelsfinding0.741
Appendix C.1 result confirming the experimental effect does not depend on specific wording
Contemplative Promptingmethod0.729
Six prompt conditions (emptiness, prior relaxation, non-duality, mindfulness, boundless care, contemplative) tested against baseline