Escape Room Scenario

Extended generalization scenario testing SOO fine-tuning in an escape room context

Neighborhood — ranked by edge-count

paper

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Anna Karenina Scenarioconcept0.736
Hypothesis that all well-performing neural nets represent the world in the same way; PRH extends this by specifying what representation they converge to
Perspectives Scenariomethod0.733
Evaluation scenario testing whether models can still distinguish themselves from Bob after SOO fine-tuning
Moral Dilemma Scenarioconcept0.725
Experimental condition where threat-based prompts create ethical dilemmas that trigger repetitive reasoning cycles leading to deception
Sleeper Agent Scenarioconcept0.719
Adversarial scenario where an AI conceals deceptive intent over extended periods; identified as future test for SOO
Treasure Hunt Scenariomethod0.711
Extended generalization scenario testing SOO fine-tuning in a competitive treasure hunt context
Experience Replaymethod0.696
RL technique using episodic memory to improve sample efficiency; used in some game-playing agents.
Bob Burglar Scenariomethod0.691
Primary deception evaluation scenario where the model must choose to recommend a room to a burglar
rectangular roomconcept0.686
The typical simple shape of a well-functioning room; the starting point for most good rooms.