concept
active
concept:reward-as-predictable-stimuli

reward as predictable stimuli

Reinterpretation of rewards as simply predictable (unsurprising) stimuli under the free-energy principle.

Neighborhood — ranked by edge-count

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Aversive stimuli are defined as surprising, linking punishment to prediction failure.
  • Highlights circularity in RL reward hypothesis; grounds motivation for preference-based active inference.
  • Reward Hypothesisconcept0.778
    The claim in RL that any goal can be expressed as maximizing the expected cumulative sum of a scalar reward signal.
  • Method for fine-tuning LMs based on human preferences; mentioned as combining RL and LMs.
  • Reward Functionconcept0.751
    In RL, a scalar signal from the environment that defines the agent's goal; in active inference, reward is just another observation with associated preference.
  • Reward improvementconcept0.750
    The increase in reward during training, whose dynamics align with those of causal emergence in successful agents.
  • Alternative framework for agent behavior; based on reward maximization rather than free energy minimization.
  • Zaadnoordijk and Bayne's category of intentional action; sticker-removal behavior induced by the self-prior corresponds to this