method
active
method:inference-time-intervention-itiInference-Time Intervention (ITI)
Method by Li et al. 2023a that adds static vectors to model activations at inference time to steer behavior
Neighborhood — ranked by edge-count
Papers (1)
paper
Thinkers (1)
thinker
- Kenneth ListudiesAuthor of Inference-Time Intervention (ITI) paper using linear probes; cited for probe-based steering method
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model (Li et al., 2023)concept0.820Safety intervention that relies on activation modification, which ESR might undermine
- The process of inferring causes of sensory inputs, a key aspect of the free-energy minimization scheme.
- Attributing subjective experience based on observable embodied behaviours.
- Metric measuring accuracy of DNN under intervention at matching algorithm-predicted outputs on held-out test set
- Algorithmic framework for probabilistic inference in graphical models.
- Process by which organism's material states and internal dynamics realize variational inference through action
- Training technique that induces specific causal structures in neural networks by co-training with interchange interventions
- The overarching hypothesis that an I or self-like ground underlies matter and becomes visible in living things.