concept
active
concept:claude-4-opus

Claude 4 Opus

Anthropic model; outlier in Experiment 1 with high baseline affirmation including under zero-shot and history conditions

Neighborhood — ranked by edge-count

Concepts (1)

concept
  • Claude 3 Opus
    related_to
    Primary model studied; production LLM that exhibits alignment faking in experiments

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.