finding
active
finding:mds-injections-can-steer-toward-multiple-distinct-constructs-in-the-same-completion-producing-strongly-polarized-yet-smoothly-connected-segments

MDS injections can steer toward multiple distinct constructs in the same completion, producing strongly polarized yet smoothly connected segments

Qualitative finding demonstrating unique capability of activation-level interventions unavailable to prompting methods including PM

Source paper

extracted_from
Psychological Steering of Large Language Models
(2026) · Leonardo Blas · Robin Jia · Emilio Ferrara

Neighborhood — ranked by edge-count

Claims (1)

claim

Frameworks (1)

framework
  • Established baseline for OCEAN steering via personality-descriptive system prompts; compared against injection methods throughout

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.