claim
active
claim:traditional-rl-frameworks-optimize-externally-defined-reward-functions-lacking-representational-depth-for-mental-state-reasoningTraditional RL frameworks optimize externally defined reward functions lacking representational depth for mental-state reasoning
Motivation claim positioning this paper against standard RL approaches
Source paper
extracted_from(2026) · Michael Petrowski · Milica Gašić
Neighborhood — ranked by edge-count
Books (1)
book
- Standard RL textbook cited for traditional reward function optimization
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Secondary empirical result: CE-based representational changes correlate with task success.
- A competing alignment approach that fine-tunes models based on human evaluator feedback; discussed as complementary to SOO
- Critique of competing approaches that motivates SOO as filling a gap
- Central finding: causal emergence serves as a previously undisclosed axis of neural representation reorganization in learning agents.
- Argument that RL meets the agency indicator.
- Discussion of Figure 3.
- Empirical result: CE measurements correlate with and predict learning performance in RL agents.
- Machine learning paradigm where agents learn to maximize cumulative reward through interaction.