finding
active
finding:user-message-embeddings-predict-subsequent-model-assistant-axis-projection-with-r2-0-53-0-77-p-0-001-but-predict-delta-from-previous-response-with-only-r2-0-10

User message embeddings predict subsequent model Assistant Axis projection with R2=0.53-0.77 (p<0.001) but predict delta from previous response with only R2=0.10

Shows model persona position is primarily determined by the most recent user message, not prior drift

Source paper

extracted_from
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
(2026) · Christina Lu · Jack Gallagher · Jonathan Michala · Kyle Fish +1

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.