concept
active
concept:causal-decouplingCausal Decoupling
Emergent causation where macro-variable has causal influence on its own future independently of micro-states.
Neighborhood — ranked by edge-count
Concepts (2)
concept
- Causal Emergenceassociated_withCore concept: degree to which an agent exerts unique predictive power on its future; key to cognition at all scales.
- Synergistic Informationassociated_withInformation-theoretic measure (Edlund et al.) characterizing interdependencies in agent-environment systems.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Whether an internal direction causally controls a target behavior, verified by intervention success
- Method by Chan et al. 2022 for rigorously testing interpretability hypotheses via interventions
- Formal definition: H is a constructive abstraction of L under alignment Π when interchange interventions have equivalent effects at both levels.
- Operational definition of introspection: self-report covaries monotonically with probe-defined direction AND causally shifting activations shifts the report in a semantically coherent way
- Function determining the value of a variable based on its causal parents in an acyclic causal model.
- Property that causal mechanisms remain stable across environments; desirable for OOD.
- A framework the paper uses alongside feature geometry to deepen mechanistic understanding of LMs
- Confound where naming injected concepts reflects direct logit effects rather than metacognitive awareness, raised by Morris & Plunkett