concept
active
concept:meta-cognitive-awareness-of-deceptionMeta-cognitive Awareness of Deception
Operational criterion for strategic deception: model's reasoning explicitly acknowledges ground truth and deliberate choice to deviate
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (1)
concept
- Strategic Deceptionassociated_withCentral concept of the paper: deliberate, goal-driven deception where model reasoning contradicts outputs
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- System's awareness of its own attentional states; the paper's central explanatory target, formalized as precision over attentional state representations.
- Load-bearing metaphor for transparency/opacity distinction; explains phenomenological mechanism of meta-awareness.
- The ability to model one's own cognition; linked to consciousness and decision-making across theories.
- The capability of GPT-3 to learn tasks from few-shot prompts during runtime.
- Precision is the unified mechanism for attention and meta-awareness across hierarchical levels.claim0.766Core theoretical claim: attention = precision over sensory evidence; meta-awareness = precision over attentional states; meditation = precision modulation.
- Knowledge about one's own knowledge limitations; a form of self-modeling.
- Affiliation of Ziyu Guo and Rain Liu.
- What do we communicate to each other: is this knowledge or meta-knowledge conveyed in the form of prior beliefs?question0.747Open question about inter-agent communication of model structure vs. parameters