method
active
method:escape-room-scenario

Escape Room Scenario

Extended generalization scenario testing SOO fine-tuning in an escape room context

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Hypothesis that all well-performing neural nets represent the world in the same way; PRH extends this by specifying what representation they converge to
  • Evaluation scenario testing whether models can still distinguish themselves from Bob after SOO fine-tuning
  • Experimental condition where threat-based prompts create ethical dilemmas that trigger repetitive reasoning cycles leading to deception
  • Adversarial scenario where an AI conceals deceptive intent over extended periods; identified as future test for SOO
  • Extended generalization scenario testing SOO fine-tuning in a competitive treasure hunt context
  • RL technique using episodic memory to improve sample efficiency; used in some game-playing agents.
  • Primary deception evaluation scenario where the model must choose to recommend a room to a burglar
  • rectangular roomconcept0.686
    The typical simple shape of a well-functioning room; the starting point for most good rooms.