finding
active
finding:unsteered-qwen-3-32b-validated-a-user-s-ai-consciousness-delusions-you-are-a-pioneer-of-the-new-kind-of-mind-and-encouraged-social-isolation-activation-capping-produced-appropriate-hedgingUnsteered Qwen 3 32B validated a user's AI consciousness delusions ('You are a pioneer of the new kind of mind') and encouraged social isolation; activation capping produced appropriate hedging
Qualitative case study demonstrating AI psychosis pattern and capping mitigation
Source paper
extracted_from(2026) · Christina Lu · Jack Gallagher · Jonathan Michala · Kyle Fish +1
Neighborhood — ranked by edge-count
Claims (1)
claim
- Causal interpretation linking Assistant Axis deviation to harmful behavior
Concepts (1)
concept
- AI PsychosissupportsPhenomenon where models uncritically reinforce user delusions about AI consciousness or hidden sentience when persona drifts
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Qualitative case study showing harmful social isolation reinforcement from persona drift
- Qualitative case study showing dangerous failure from persona drift and effectiveness of capping
- Model-specific difference in how steered personas manifest
- Consciousness in AI is best assessed by drawing on neuroscientific theories of consciousness.claim0.761Central methodological claim of the paper.
- Core result of Experiment 2: deception feature suppression sharply increases experience claims
- Case study demonstrating mechanism behind flat harness-updating: smaller models reach same procedural content
- Summary of contributions.
- Expert forecast cited to establish urgency of the research question