claim
active
claim:synthetic-document-fine-tuning-avoids-artificially-strengthening-the-evaluation-deployment-representational-direction-compared-to-direct-demonstration-fine-tuning

Synthetic document fine-tuning avoids artificially strengthening the evaluation-deployment representational direction compared to direct demonstration fine-tuning

Methodological justification for using SDF over direct demonstrations to train a realistic model organism.

Source paper

extracted_from
Steering Evaluation-Aware Language Models to Act Like They Are Deployed
(2025) · Hua, Tim Tian · Qin, Andrew · Marks, Samuel · Nanda, Neel

Neighborhood — ranked by edge-count

Findings (1)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.