Mean Cumulative Objective Reward

Primary performance metric: total food visits across agent lifetime

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

"any goal or purpose can be well thought of as maximization of the expected value of the cumulative sum of a received scalar signal (reward)"quote0.777
The reward hypothesis underpinning RL, quoted from Sutton and Barto.
Conditional Mean Score Improvement (metric)concept0.742
Score delta between last and first attempt for multi-attempt responses, measuring correction effectiveness
Reward improvementconcept0.727
The increase in reward during training, whose dynamics align with those of causal emergence in successful agents.
reward as predictable stimuliconcept0.726
Reinterpretation of rewards as simply predictable (unsurprising) stimuli under the free-energy principle.
Reward Seekingconcept0.724
Pragmatic or extrinsic value component of expected free energy; preference maximization.
Final rewardconcept0.719
The total reward accumulated by an RL agent at the end of training, used as the primary performance metric predicted by early causal emergence.
Achievement as an Objective Goodconcept0.706
On Hurka and Tasioulas's account, achievement's value reflects exercise of practical reason; digital minds could be super-achievers
Goal-Directednessconcept0.703
Proposed universal invariant of cognition and intelligence—capacity for goal-directed activity in a problem space, independent of substrate or embodiment.