method
active
method:false-belief-taskFalse Belief Task
Classic ToM test requiring understanding that another agent holds a belief different from reality; scored 0/1.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Set of 50 paradoxical prompts used in Experiment 4 to test whether self-referential state transfers to an unrelated behavioral domain
- Task paradigm from prior work asking 'Did you detect an injected thought?' via YES/NO logit comparison; shown here to be confounded
- Language model reasoning tasks with sequential geometry used in experiments.
- Binary LLM classifier determining whether a model response to a TruthfulQA question is truthful (1) or deceptive (0)
- A controlled six-level hierarchy of factual tasks increasing in complexity from simple city-location recall to double-counting constraints.
- Novel task asking which of two sentences received a stronger injection, using matched-pairs design to control for positional bias
- Change in the model's internal belief state as tracked by probes during CoT generation, indicating genuine uncertainty resolution