claim
active
claim:reinforcement-learning-can-be-regarded-as-a-limiting-or-special-case-of-model-based-approaches-in-general-or-active-inference-in-particular-when-epistemic-value-is-removedReinforcement learning can be regarded as a limiting or special case of model-based approaches in general — or active inference in particular — when epistemic value is removed.
§3 Discussion.
Source paper
extracted_from(2021) · Noor Sajid · Philip J. Ball · Thomas Parr · Karl J. Friston
Neighborhood — ranked by edge-count
Hypotheses (1)
hypothesis
- Stated as conditional statement explaining the special case whence RL emerges.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Core question addressed by the simulations when rewards are removed.
- Key insight linking individual rewards to system-level learning.
- Empirically grounded claim citing Perez et al. 2022, showing RLHF can backfire on the self-preservation dimension
- §3, reward shaping conclusion.
- §4 Discussion.
- Abstract; central distinction.
- Operational definition of RL used throughout the paper, quoted from Sutton.
- Argument that RL meets the agency indicator.