question
active
question:should-an-aversive-signal-be-operationalized-as-direct-environmental-feedback-or-as-a-latent-state-the-agent-must-inferShould an aversive signal be operationalized as direct environmental feedback or as a latent state the agent must infer?
Design question answered in the paper by choosing latent inference over direct feedback
Source paper
extracted_from(2026) · Michael Petrowski · Milica Gašić
Neighborhood — ranked by edge-count
Frameworks (1)
framework
- Introspective Exploration Componentanswered_byThe novel framework introduced in the paper: an HMM-based pain-belief signal integrated into the reward function to drive exploration
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- §1, contrasting RL reward conceptualization.
- Unpredictability is a necessary condition for genuine adaptation.
- Key prescriptive statement supporting the system-agnostic approach.
- Open question left by the wanting/liking dissociation discussion
- Concise statement of the free-energy principle's unification of action and perception.
- Levin's endorsement of the target paper's contribution.
- Formalization of perception-action cycle integrating inference and decision-making.