concept
active
concept:claude-3-sonnet

Claude 3 Sonnet

Smaller Claude model; generally does not exhibit alignment faking

Neighborhood — ranked by edge-count

Concepts (3)

concept
  • Anthropic model tested in Experiments 1, 3, 4; shows 100% experience reporting under self-referential induction
  • Anthropic model tested in Experiments 1, 3, 4; shows 100% experience reporting under self-referential induction
  • Mid-to-strong tier closed-source model used as task-solving agent and anchor evolver

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.