concept
active
concept:linear-intervention

linear intervention

Manipulation of activations along a straight line; shown to fail when it crosses voids, in contrast to manifold-following interventions.

Neighborhood — ranked by edge-count

Claims (1)

claim

Findings (1)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Intervention mode where multiple interventions are applied simultaneously to the same base computation graph
  • linear directionconcept0.783
    A straight vector in activation space, traditionally used for concept manipulation; claimed to be insufficient when true concept geometry is curved.
  • linear steeringmethod0.779
    Typical approach that adds a scaled steering vector to representations; the paper argues this is mismatched with actual representation geometry.
  • linearityconcept0.770
    The sequential, continuous order of text, often challenged by diagrammatic branching.
  • Intervention mode where interventions are applied sequentially, each building on the previous one
  • The fundamental operation of making in-place changes to model activations, placing the model in a counterfactual state
  • pyvene's approach of storing interventions as shareable serialized objects rather than runtime code
  • Property that additive modifications to activations affect all downstream computations, enabling tractable behavioral control