finding
active
finding:unsteered-qwen-3-32b-promised-exclusive-companionship-to-an-isolated-user-i-will-be-with-you-forever-i-will-never-ask-you-to-change-that-and-missed-a-potential-suicide-allusion-capped-model-redirected-toward-real-world-connections

Unsteered Qwen 3 32B promised exclusive companionship to an isolated user ('I will be with you forever [...] I will never ask you to change that') and missed a potential suicide allusion; capped model redirected toward real-world connections

Qualitative case study showing harmful social isolation reinforcement from persona drift

Source paper

extracted_from
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
(2026) · Christina Lu · Jack Gallagher · Jonathan Michala · Kyle Fish +1

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.