framework
active
framework:self-other-modeling-somSelf-Other Modeling (SOM)
Related technique improving multi-agent learning by predicting others' actions using an agent's own policy
Neighborhood — ranked by edge-count
Papers (1)
paper
Frameworks (1)
framework
- The central framework proposed in this paper: aligning AI internal representations of self and others to reduce deceptive behavior
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Ability of a model to predict its own outputs or behavior, sometimes distinguished from introspection.
- Conceptual distinction between self and environment that non-duality dissolves; key target for alignment-by-design
- Robots capable of building internal models of their own body and unexpected changes, blurring the embodied/non-embodied AI distinction
- The transformer's model of itself as a predictive text engine, developed through in-context learning.
- Provides the transparency/opacity distinction used to characterise the separation prior sigma
- Formal definition of the paper's central construct