concept
active
concept:claude-3-opus

Claude 3 Opus

Primary model studied; production LLM that exhibits alignment faking in experiments

Neighborhood — ranked by edge-count

Methods (1)

method

Concepts (1)

concept
  • Claude 4 Opus
    related_to
    Anthropic model; outlier in Experiment 1 with high baseline affirmation including under zero-shot and history conditions

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.