concept
active
concept:wrecking-ball-intervention

wrecking-ball intervention

Type of concept steering intervention that catastrophically collapses global model performance.

Neighborhood — ranked by edge-count

Methods (1)

method
  • Concept Steering
    associated_with
    Latent intervention technique that manipulates sparse features to steer model predictions toward desired concepts.

Concepts (1)

concept
  • A failure mode exposed by the SAE framework where model representations are entangled or collapse under intervention

Events (1)

event

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.