framework
active
framework:causal-influence-diagramsCausal Influence Diagrams
Framework informing path-specific objectives by identifying causal chains leading to risky behaviors
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Traditional mechanistic accounts (Danto, Chisholm, Goldman) that Juarrero critiques as resting on outdated Newtonian causality.
- Mechanistic interpretability technique for locating factual associations, mentioned as future work direction.
- Chvykov and Hoel's geometric extension of causal emergence to continuous systems using Fisher information.
- The ability of an agent to be a driver of subsequent events; a hallmark of cognition that causal emergence quantifies.
- A framework the paper uses alongside feature geometry to deepen mechanistic understanding of LMs
- The use of interventions (rather than correlations) to establish a causal link between representation geometry and behavioral geometry.
- Function determining the value of a variable based on its causal parents in an acyclic causal model.
- A measure of whether a subcomponent is necessary to reproduce model behavior on a specific prompt, predicted by the causal importance network.