Self-Other Modeling (SOM)

Related technique improving multi-agent learning by predicting others' actions using an agent's own policy

Neighborhood — ranked by edge-count

paper

framework

Self-Other Overlap (SOO) Fine-Tuning
extends
The central framework proposed in this paper: aligning AI internal representations of self and others to reduce deceptive behavior

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Self-modelingconcept0.832
Ability of a model to predict its own outputs or behavior, sometimes distinguished from introspection.
Self Modelframework0.807
Self-Other Boundaryconcept0.787
Conceptual distinction between self and environment that non-duality dissolves; key target for alignment-by-design
Self-Modeling Robotsmethod0.785
Robots capable of building internal models of their own body and unexpected changes, blurring the embodied/non-embodied AI distinction
self-model (transformer)concept0.784
The transformer's model of itself as a predictive text engine, developed through in-context learning.
Self-Model Theory of Subjectivityframework0.782
Provides the transparency/opacity distinction used to characterise the separation prior sigma
We define Self-Other Overlap (SOO) as the extent to which a model exhibits similar internal representations when reasoning about itself and others in similar contexts.quote0.781
Formal definition of the paper's central construct
Self Modeling Dynamical Systemsframework0.776