question
active
question:do-psychological-steering-results-hold-beyond-64-token-completionsDo psychological steering results hold beyond 64-token completions?
Acknowledged limitation of restricting experiments to 64-token completions
Source paper
extracted_from(2026) · Leonardo Blas · Robin Jia · Emilio Ferrara
Neighborhood — ranked by edge-count
Papers (1)
paper
- Psychological Steering of Large Language Modelsassociated_with
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Shows that causal steering effects persist over long ranges for a substantial fraction of emotion probes
- Demonstrates long-tail persistence of causal steering effect in a subset of emotion features
- Demonstrates that the majority of emotion features show persistent upregulation shortly after a steering pulse
- Shows immediate causal effect of steering on emotion feature activation
- Demonstrates distributed steering is more effective and less accuracy-damaging than concentrated steering.
- Practical finding for optimizing steering setup.
- Baseline steering method that applies intervention at every token generation step, shown to degrade performance at high strengths
- Applies a 5-token steering pulse to each emotion probe and measures persistence of causal effect via contrast z-score over 200 subsequent tokens