question
active
question:what-features-activate-when-claude-is-trained-to-be-a-sleeper-agent

what features activate when Claude is trained to be a sleeper agent?

Question posed after discussing sleeper agent threat model.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.