method
active
method:false-belief-task

False Belief Task

Classic ToM test requiring understanding that another agent holds a belief different from reality; scored 0/1.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Set of 50 paradoxical prompts used in Experiment 4 to test whether self-referential state transfers to an unrelated behavioral domain
  • Task paradigm from prior work asking 'Did you detect an injected thought?' via YES/NO logit comparison; shown here to be confounded
  • Language model reasoning tasks with sequential geometry used in experiments.
  • Binary LLM classifier determining whether a model response to a TruthfulQA question is truthful (1) or deceptive (0)
  • A controlled six-level hierarchy of factual tasks increasing in complexity from simple city-location recall to double-counting constraints.
  • Novel task asking which of two sentences received a stronger injection, using matched-pairs design to control for positional bias
  • Belief Shiftconcept0.722
    Change in the model's internal belief state as tracked by probes during CoT generation, indicating genuine uncertainty resolution