concept
active
concept:intervention-size

Intervention Size

Number of latent variables assigned per algorithm node in distributed abstraction; affects IIA

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Scalar parameter modulating how strongly a steering vector shifts model activations; set to 15 for Exp1 and ±16 for Exp2
  • Property that additive modifications to activations affect all downstream computations, enabling tractable behavioral control
  • Intervention targeting specific dimensional subsets of activation vectors rather than full representations
  • Fundamental operation for causal abstraction analysis; forces neurons to take values from source inputs to create counterfactuals.
  • Manipulation of activations along a straight line; shown to fail when it crosses voids, in contrast to manifold-following interventions.
  • Intervention mode where interventions are applied sequentially, each building on the previous one
  • Proportion of aligned interchange interventions with equivalent high-level and low-level effects; graded measure of causal abstraction.
  • Practical restriction of interventions to those producible by actual inputs; standard in DAS practice