claim
active
claim:the-elimination-of-reward-as-a-motivator-of-behavior-with-prior-beliefs-dissolves-the-tautology-of-reinforcement-learning-rewards-reinforce-behaviors-that-secure-rewardsThe elimination of reward as a motivator of behavior with prior beliefs dissolves the tautology of reinforcement learning (rewards reinforce behaviors that secure rewards).
§4 Discussion.
Source paper
extracted_from(2021) · Noor Sajid · Philip J. Ball · Thomas Parr · Karl J. Friston
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The circular definition in RL where rewards reinforce behaviors that secure rewards, e.g., going to a cafe because coffee is rewarding.
- §3 Discussion.
- Explanation of how knowledge (not just parameters) is shared between agents; links to pre-Cartesian consciousness
- Key insight linking individual rewards to system-level learning.
- Highlights circularity in RL reward hypothesis; grounds motivation for preference-based active inference.
- Argument that RL meets the agency indicator.
- Load-bearing summary of the main empirical finding that anchors the Causally Emergent Alignment Hypothesis.