paper
referenced-only
paper:arxiv-2504-04635Steering off course: Reliability challenges in steering language models
External IDs
Frontmatter (8 fields)
{
"doi": null,
"year": 2025,
"title": "Steering off course: Reliability challenges in steering language models",
"venue": null,
"authors": [
"Patrick Queiroz Da Silva",
"Hari Sethuraman",
"Dheeraj Rajagopal",
"Hannaneh Hajishirzi",
"Sachin Kumar"
],
"arxiv_id": "2504.04635",
"s2_paper_id": null,
"ingest_status": "referenced-only"
}Outgoing (0)
None.
Incoming (0)
None.