paper
referenced-only
paper:arxiv-2504-04635

Steering off course: Reliability challenges in steering language models

External IDs

title_hash
a1feb20124fca57dbef2f9579414867a2491f882
legacy_slug
arxiv-2504-04635
Frontmatter (8 fields)
{
  "doi": null,
  "year": 2025,
  "title": "Steering off course: Reliability challenges in steering language models",
  "venue": null,
  "authors": [
    "Patrick Queiroz Da Silva",
    "Hari Sethuraman",
    "Dheeraj Rajagopal",
    "Hannaneh Hajishirzi",
    "Sachin Kumar"
  ],
  "arxiv_id": "2504.04635",
  "s2_paper_id": null,
  "ingest_status": "referenced-only"
}

Outgoing (0)

None.

Incoming (0)

None.