quote
active
quote:wrecking-ball-interventions-that-collapse-global-model-performancewrecking-ball interventions that collapse global model performance
Load-bearing phrase describing catastrophic steering effects.
Source paper
extracted_from(2026) · William Lehn-Schiøler · Magnus Ruud Kjær · Rahul Thapa · M. Pedersen +9
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Demonstrates a critical failure mode of concept steering with clinical safety implications
- Observation of catastrophic performance drop when steering certain concepts.
- A critical failure mode identified in the paper demonstrating risk of naïve concept steering
- Type of concept steering intervention that catastrophically collapses global model performance.
- Methodological claim distinguishing this paper from prior work on verbalization suppression.
- Prediction orthogonality thesis.
- Raised when discussing whether collapsed awareness is like a trauma response.