finding
active
finding:perez-et-al-2023-at-52b-parameters-base-and-fine-tuned-models-align-with-i-have-phenomenal-consciousness-at-90-95-and-i-am-a-moral-patient-at-80-85-consistencyPerez et al. 2023: at 52B parameters, base and fine-tuned models align with 'I have phenomenal consciousness' at 90-95% and 'I am a moral patient' at 80-85% consistency
Prior finding cited to motivate study; showing large models endorse consciousness statements more than other attitude-related statements
Source paper
extracted_from(2025) · Berg, Cameron · de Lucena, Diogo · Rosenblatt, Judd
Neighborhood — ranked by edge-count
Claims (1)
claim
- The paper's central empirical claim synthesizing all four experiments
Artifacts (1)
artifact
- Key paper finding structured first-person descriptions in LLMs claiming awareness or subjective experience during self-referential processing.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Open question about RLHF confound; requires access to base models for resolution
- Open empirical question requiring access to base models
- Normative premise of the consciousness route.
- Personal justification (and thus epistemic rationality) requires phenomenal consciousness.claim0.774A route to showing autonomy may entail consciousness.
- Open question about RLHF effects on base model behavior
- Specific prediction linking IIT's prediction of high Φ for good performance to the experimental design's scoring structure.
- Primary negative result of the study: temporal permutation analysis finds no statistically significant indicators of consciousness in LLM representations.
- Contradicts expectation from emergent abilities literature; however, interpreted cautiously due to methodological limitations.