hypothesis
active
hypothesis:why-mechanistically-should-mesaoptimizers-form-in-predictive-learning-versus-for-instance-in-reinforcement-learning-or-gansWhy mechanistically should mesaoptimizers form in predictive learning, versus for instance in reinforcement learning or GANs?
Open research question.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Papers (1)
paper
- Simulators — LessWrongmentions
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Distillation of why learning generalises.
- Ongoing experimental test: using rewards and punishments to shape anatomical outcomes without micromanaging molecular pathways.
- Empirically grounded claim citing Perez et al. 2022, showing RLHF can backfire on the self-preservation dimension
- Load-bearing summary of the main empirical finding that anchors the Causally Emergent Alignment Hypothesis.
- Prior empirical observation from biological systems; motivates investigation in artificial agents.
- Key limitation acknowledged by authors.
- Selective pressure toward convergence via task generality