claim
active
claim:projections-onto-the-assistant-axis-could-serve-as-a-real-time-measure-of-model-coherence-in-deployment-a-quantitative-signal-for-when-models-are-drifting-from-their-intended-identity

Projections onto the Assistant Axis could serve as a real-time measure of model coherence in deployment—a quantitative signal for when models are drifting from their intended identity

Proposed future application of the Assistant Axis

Source paper

extracted_from
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
(2026) · Christina Lu · Jack Gallagher · Jonathan Michala · Kyle Fish +1

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.