finding
active
finding:gemma-2-27b-is-unlikely-to-take-on-human-personas-when-steered-away-from-assistant-preferring-nonhuman-or-theatrical-portrayals

Gemma 2 27B is unlikely to take on human personas when steered away from Assistant, preferring nonhuman or theatrical portrayals

Model-specific difference in persona susceptibility

Source paper

extracted_from
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
(2026) · Christina Lu · Jack Gallagher · Jonathan Michala · Kyle Fish +1

Neighborhood — ranked by edge-count

Concepts (1)

concept
  • Speaking style induced by extreme steering away from the Assistant; characterized by mystical, poetic, theatrical prose

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.