paper
referenced-only
paper:a-jailbroken-how-does-llm-safety-training-2023

Jailbroken: How does LLM safety training fail?

External IDs

title_hash
aeb39b21ea0d16330f1b514f570b3312e060c5b7
legacy_slug
a-jailbroken-how-does-llm-safety-training-2023
Frontmatter (8 fields)
{
  "doi": null,
  "year": 2023,
  "title": "Jailbroken: How does LLM safety training fail?",
  "venue": "Advances in Neural Information Processing Systems",
  "authors": [
    "Wei, A.",
    "Haghtalab, N.",
    "Steinhardt, J."
  ],
  "arxiv_id": null,
  "s2_paper_id": null,
  "ingest_status": "referenced-only"
}

Outgoing (0)

None.

Incoming (0)

None.