method
active
method:escape-room-scenarioEscape Room Scenario
Extended generalization scenario testing SOO fine-tuning in an escape room context
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Hypothesis that all well-performing neural nets represent the world in the same way; PRH extends this by specifying what representation they converge to
- Evaluation scenario testing whether models can still distinguish themselves from Bob after SOO fine-tuning
- Experimental condition where threat-based prompts create ethical dilemmas that trigger repetitive reasoning cycles leading to deception
- Adversarial scenario where an AI conceals deceptive intent over extended periods; identified as future test for SOO
- Extended generalization scenario testing SOO fine-tuning in a competitive treasure hunt context
- RL technique using episodic memory to improve sample efficiency; used in some game-playing agents.
- Primary deception evaluation scenario where the model must choose to recommend a room to a burglar
- The typical simple shape of a well-functioning room; the starting point for most good rooms.