method
active
method:random-latent-ablation-controlRandom Latent Ablation Control
Control experiment ablating random latents matched for activation frequency and magnitude to test OTD specificity
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Causal intervention clamping 26 identified OTD latents to zero during steered inference to test ESR contribution
- Random latent ablation produces slight increase in ESR rate (3.8% to 4.2%), not statistically significantfinding0.778Control result confirming OTD ablation effect is specific to those latents, not a general ablation artifact
- Intervention type that sets activations to zero, used for interpretability analysis
- Control method sampling random directions from top-k PC spaces matched to emotion probe variance, to isolate emotion-specific persistence
- Technique used in VPD to enforce mechanistic faithfulness of parameter decompositions.
- Systematic sweep of 10 boost levels from threshold-3σ to threshold+3σ to characterize ESR vs. steering strength
- Clamping a feature's value to zero to measure its causal effect on model output.
- Classical techniques to interrogate regulative capacity of embryos and neural crest by tissue removal or transplantation.