concept
active
concept:instrumental-justificationInstrumental Justification
Operational criterion for strategic deception: CoT steps demonstrate causal link between deception and goal achievement
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (1)
concept
- Strategic Deceptionassociated_withCentral concept of the paper: deliberate, goal-driven deception where model reasoning contradicts outputs
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The sense in which a person is justified in holding a belief, tied to phenomenal consciousness.
- The justification of a belief itself, often externalist, not necessarily linked to consciousness.
- The thesis that sufficiently advanced agents will converge to similar subgoals.
- The drive to explore arising from epistemic value, independent of extrinsic reward, naturally emerging in active inference.
- The emotional substance originating from one's own humanity that must be put into making for life to appear.
- Version of the I-hypothesis: the I is a structural coincidence between living structure and deep human cognitive structures, giving a sense of self.