method
active
method:mean-squared-error-between-self-and-other-activationsMean Squared Error between self and other activations
The specific implementation of SOO loss using MSE between self_attn.o_proj outputs at a specified layer
Neighborhood — ranked by edge-count
Concepts (1)
concept
- The attention output projection layer where SOO Loss is computed; maps multi-head attention outputs to hidden dimension
Methods (1)
method
- A loss function measuring the dissimilarity of latent model representations of self and other, minimized during fine-tuning
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Latent model activations when processing inputs framed from the model's own perspective
- Latent model activations when processing inputs framed from another agent's perspective
- Conceptual distinction between self and environment that non-duality dissolves; key target for alignment-by-design
- Assumption that small anchor changes can produce sharp performance shifts when conditions are favorable.
- The extent to which a model exhibits similar internal representations when reasoning about itself and others in similar contexts
- Core slogan encapsulating the paradigm shift of VPD.
- Process of reifying one's identity as an independent self; meditation practices aim to decrease selfing.