concept
active
concept:intervenable-modelIntervenable Model
pyvene class that decorates a torch model with hooks allowing activations to be collected and overwritten
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (2)
concept
- Dict-based configuration format in pyvene that outlines which model components will be intervened upon
- Getter and Setter HooksimplementsTwo types of hooks implemented by IntervenableModel to save and set activations during forward passes
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- A representation that captures relevant aspects of a system; according to the theorem, the regulator must embody this.
- Formal representation of algorithms as directed acyclic graphs computing functions f_A
- Primary test domain for manifold steering, including reasoning and ICL tasks
- Probability of data under the model, penalizing complexity and rewarding accuracy.
- Comparing models using log-evidence approximated by free energy.
- Class of large language models designed to produce extended chain-of-thought before answering, studied in this paper