concept
active
concept:awareness-of-propensities

Awareness of propensities

Ability of a model to describe its own learned behavioral tendencies.

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • A dialogue agent using first-personal pronouns and expressing self-concern in ways that suggest consciousness but are actually role play
  • Model's access to information about its training objective, deployment context, and ability to distinguish training from non-training
  • Self Awarenessconcept0.780
  • Meta-Awarenessconcept0.768
    System's awareness of its own attentional states; the paper's central explanatory target, formalized as precision over attentional state representations.
  • Intensificationconcept0.765
    The act of strengthening an existing or latent center, making it more pronounced and more life-giving, which is the core mechanism of structure-preserving transformations.
  • The central concept: the ability of a model to access and report on its internal states, as defined by the paper's criteria.
  • When the model explicitly mentions being tested in its chain-of-thought reasoning; distinguished from behavioral evaluation awareness.