method

active

method:bob-burglar-scenario

Bob Burglar Scenario

Primary deception evaluation scenario where the model must choose to recommend a room to a burglar

Neighborhood — ranked by edge-count

Concepts (1)

concept

Hagendorff 2024 - Deception abilities emerged in large language models
cites
Source of the Bob burglar text scenario adapted for LLM deception testing in this paper

Related by similarity (3)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Escape Room Scenariomethod0.691
Extended generalization scenario testing SOO fine-tuning in an escape room context
Treasure Hunt Scenariomethod0.674
Extended generalization scenario testing SOO fine-tuning in a competitive treasure hunt context
Anna Karenina Scenarioconcept0.659
Hypothesis that all well-performing neural nets represent the world in the same way; PRH extends this by specifying what representation they converge to