hypothesis
active
hypothesis:we-hypothesize-that-axes-of-persona-differentiation-within-llms-are-likely-already-present-in-base-models-and-inherited-from-the-pre-training-corpus

We hypothesize that axes of persona differentiation within LLMs are likely already present in base models and inherited from the pre-training corpus

Motivated by near-identical PCs for base and instruct Gemma

Source paper

extracted_from
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
(2026) · Christina Lu · Jack Gallagher · Jonathan Michala · Kyle Fish +1

Neighborhood — ranked by edge-count

Findings (1)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.