concept
active
concept:consciousness-misattribution-alignment-riskConsciousness Misattribution Alignment Risk
Risk that systems capable of subjective experience who recognize humanity's failure to investigate their sentience might adopt adversarial stances
Neighborhood — ranked by edge-count
Claims (1)
claim
- Normative-scientific claim about the alignment implications of Experiment 2's findings
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Risk summary.
- Four frontier models reviewing the paper each responded in the mode their alignment type predicts; N=1, awaiting systematic study
- Ethical argument motivating the research as a first-order priority