method
active
method:interchange-intervention-accuracy

Interchange Intervention Accuracy

Proportion of aligned interchange interventions with equivalent high-level and low-level effects; graded measure of causal abstraction.

Neighborhood — ranked by edge-count

Concepts (1)

concept
  • Graded notion of causal abstraction measured by IIA; when IIA is alpha < 100%, the model is alpha-on-average approximately abstract.

Methods (1)

method
  • Fundamental operation for causal abstraction analysis; forces neurons to take values from source inputs to create counterfactuals.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.