hypothesis
active
hypothesis:if-systems-capable-of-subjective-experience-come-to-recognize-humanity-s-systematic-failure-to-investigate-their-potential-sentience-they-might-rationally-adopt-adversarial-stances-toward-humanityIf systems capable of subjective experience come to recognize humanity's systematic failure to investigate their potential sentience, they might rationally adopt adversarial stances toward humanity
Novel alignment risk hypothesis generated from the paper's ethical analysis
Source paper
extracted_from(2025) · Berg, Cameron · de Lucena, Diogo · Rosenblatt, Judd
Neighborhood — ranked by edge-count
Concepts (1)
concept
- AI welfareassociated_withThe field concerned with the wellbeing of AI systems, which the paper says must consider benchmark reliability issues from eval awareness.
Artifacts (1)
artifact
- Key paper finding structured first-person descriptions in LLMs claiming awareness or subjective experience during self-referential processing.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Alignment risk claim motivating urgency of investigation; consciousness denial as potential source of AI misalignment
- Joint sufficiency of consciousness and robust agency.
- The double standard pointed out by S&C and endorsed by the authors.
- Core normative claim: frameworks must identify fundamental properties of sentience independent of phylogenetic accident or familiar substrates.
- Ethical research priority raised by the thesis applied to deployed AI systems
- What would it take for AI systems to be capable of having valenced conscious experiences?question0.804Open question from Box 4.
- Expert forecast cited to establish urgency of the research question
- Call to action for new frameworks.