artifact
active
artifact:claude-sonnet-4-5Claude Sonnet 4.5
Mid-tier LLM from Anthropic evaluated with n=14 games.
Neighborhood — ranked by edge-count
Datasets (1)
dataset
- Dataset of 240 multi-turn conversations per model between target models and Claude Sonnet 4.5 as simulated human, used to measure probe persistence