claim
active
claim:two-components-are-important-to-shaping-model-character-persona-construction-and-persona-stabilization

Two components are important to shaping model character: persona construction and persona stabilization

Overarching conceptual framework the paper introduces for model safety

Source paper

extracted_from
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
(2026) · Christina Lu · Jack Gallagher · Jonathan Michala · Kyle Fish +1

Neighborhood — ranked by edge-count

Concepts (2)

concept
  • Keeping a model anchored to its intended persona during deployment, preventing drift to harmful behaviors
  • The process of building a coherent model persona from character archetypes and traits during training

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.