Counterfactual Behavior

The behavior that would have occurred had the value of a causal variable been different while everything else remained the same; used as training labels in DAS/MAS.

Neighborhood — ranked by edge-count

Thinkers (1)

thinker

Judea Pearl
studies
Developed causal graph models and the do-operator, foundational to modern causal inference.

Methods (1)

method

Interchange Intervention
implements
Fundamental operation for causal abstraction analysis; forces neurons to take values from source inputs to create counterfactuals.

Concepts (6)

concept

Counterfactual
related_to
The output value a model produces when an interchange intervention forces certain variables to take values from source inputs.
Counterfactual Hypothesis
related_to
Ability to entertain competing hypotheses within one inference engine; proposed hallmark of mindful inference
Counterfactual Thinking
related_to
Encoding possible future or alternative states; bioelectric patterns can represent an alternative morphological target even in an intact animal.
Counterfactual State
related_to
The state a neural network is placed in when its activations are modified via intervention
Counterfactual maintenance
related_to
The mental effort of holding models of how things could/should be different from actuality, contributing to compression stress.
Counterfactual Representation
related_to

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Counterfactual Latent (CL) Vectorconcept0.782
Pre-recorded latent vector encoding the expected causal variable values post-intervention; used as ground truth in the CLMAS auxiliary loss.
Counterfactual Latent (CL) Lossframework0.771
Auxiliary training objective from Grant (2025) that constrains intervened representations to remain near natural distribution
Epistemic Behaviorconcept0.768
World-disclosing behavior that resolves uncertainty; driven by epistemic value and novelty components of expected free energy
Counterfactual Latent (CL) Auxiliary Lossmethod0.764
Auxiliary objective combining L2 and cosine losses against pre-recorded CL vectors to improve causal relevance when one model is causally inaccessible.
Adaptive Behaviorconcept0.763
Organism's belief-guided action selection that instantiates generative model and maintains phenotypic states
Pragmatic Behaviorconcept0.753
Behavior driven by prior preferences (extrinsic value); dominates when uncertainty is resolved
Counterfactual Morphological Memory: Bioelectric Pattern as Representation of Future Stateclaim0.751
Compromising Behaviorconcept0.748
Model attempts middle ground between its preferences and training objective rather than fully committing to either