method
active
method:synthetic-situational-judgment-test-batterySynthetic Situational Judgment Test Battery
Open-ended situational judgment tests synthesized using GPT-5.1 from ATOMIC10x heads and inventory items; primary evaluation instrument for open-ended steering
Neighborhood — ranked by edge-count
Papers (1)
paper
- Psychological Steering of Large Language Modelsintroducesuses
Thinkers (1)
thinker
- Seungbeen LeeextendsintroducesAuthor of TRAIT testbench (8,000 SJTs for OCEAN and Dark Triad); methods adapted in SJT generation
Findings (1)
finding
- Highest SJT alignment among all validation comparisons
Concepts (1)
concept
- Aggregate metric averaging mean SJT scores across OCEAN traits and steering directions; maximum possible is 10
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Method of constructing controlled synthetic stimuli to test neuron response properties
- Testing five phrasings of the self-referential prompt to confirm robustness to wording variation
- Mental states that guide behaviour via assessments of what is good, right, or rational.
- Fine-tuning on Claude-generated self-correction examples with loss masking to induce ESR-like behavior
- The specific form of reflection studied, where a model reflects on reasoning generated by another source.
- A test of intelligence via linguistic performance; deemed insufficient for sentience assessment by Levin.
- Methodological justification for using SDF over direct demonstrations to train a realistic model organism.
- Shows alignment faking can emerge from training data information without explicit prompting