claim
active
claim:optimizing-toward-the-simulation-objective-does-not-incentivize-instrumentally-convergent-behaviors-the-way-that-reward-functions-which-evaluate-trajectories-doOptimizing toward the simulation objective does not incentivize instrumentally convergent behaviors the way that reward functions which evaluate trajectories do.
Deontological nature of predictive loss.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Concepts (1)
concept
- Simulation objectivesupportsThe objective of minimizing predictive error on a self-supervised distribution, leading to Bayes-optimal conditional inference.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Equivalence of optimal predictor to the physics of the data.
- Prediction orthogonality thesis.
- Definition of simulation objective.
- Foundational claim unifying action and perception within single optimization framework.
- Concise statement of the free-energy principle's unification of action and perception.
- The reward hypothesis underpinning RL, quoted from Sutton and Barto.
- Demonstrates bidirectional causal link: behavior manifold geometry can be recovered by optimizing in representation space.
- Counterexample/limitation: only general-purpose models are subject to the convergence pressures described