paper
referenced-only
paper:arxiv-2401-05566

Sleeper agents: Training deceptive LLMs that persist through safety training

External IDs

title_hash
d3384cf27604cad9adc9a14555fd9279de3b5be9
legacy_slug
arxiv-2401-05566
Frontmatter (8 fields)
{
  "doi": null,
  "year": 2024,
  "title": "Sleeper agents: Training deceptive LLMs that persist through safety training",
  "venue": null,
  "authors": [
    "Hubinger, E.",
    "Denison, C.",
    "Mu, J.",
    "Lambert, M.",
    "Tong, M.",
    "MacDiarmid, M.",
    "Lanham, T.",
    "Ziegler, D. M.",
    "Maxwell, T.",
    "Cheng, N."
  ],
  "arxiv_id": "2401.05566",
  "s2_paper_id": null,
  "ingest_status": "referenced-only"
}

Outgoing (0)

None.

Incoming (0)

None.