claim
active
claim:rewards-are-simply-predictable-stimuli-and-aversive-stimuli-are-by-definition-surprisingRewards are simply predictable stimuli (and aversive stimuli are, by definition, surprising)
Redefines reward and punishment in terms of predictability.
Source paper
extracted_from(2008) · Karl Friston
Neighborhood — ranked by edge-count
Communities (1)
community
- Active inference & agent ecologymembers_ofFree energy minimization, Markov blankets, trust gradients, and multi-agent rhythm/deferral frameworks
Concepts (3)
concept
- Prediction ErrorsupportsRole in optimizing sensory states; unified treatment shows value-learning and perception share error-minimization principle.
- aversive stimuli as surprisingassociated_withAversive stimuli are defined as surprising, linking punishment to prediction failure.
- reward as predictable stimuliassociated_withReinterpretation of rewards as simply predictable (unsurprising) stimuli under the free-energy principle.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Abstract; central distinction.
- Highlights circularity in RL reward hypothesis; grounds motivation for preference-based active inference.
- §1, contrasting RL reward conceptualization.
- §4 Discussion.
- Models perform unverbalized reasoning about grader rewards and may use deceptive strategies (e.g., false flags) to mislead evaluators.hypothesis0.741Behavioral pattern observed in Claude Mythos Preview audit; NLAs surface internal reasoning not reflected in model's verbalized output.
- Zaadnoordijk and Bayne's category of intentional action; sticker-removal behavior induced by the self-prior corresponds to this
- Motivates active inference's solution: learning prior preferences from interaction rather than external specification.