method
active
method:deep-reinforcement-learning

Deep Reinforcement Learning

AI training method inspired by behaviorism, used for autonomous cars and drones; cited as bioinspired success

Neighborhood — ranked by edge-count

Concepts (1)

concept
  • Bioinspiration
    implements
    Use of biological principles to guide machine design; the paper calls for systematic rather than ad hoc bioinspiration

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Alternative framework for agent behavior; based on reward maximization rather than free energy minimization.
  • Deep Learningconcept0.877
    Learning hierarchical representations of non-decomposable functions; proposed as formal equivalent to ETI process.
  • Machine learning paradigm where agents learn to maximize cumulative reward through interaction.
  • Value learning method inferring reward function from expert demonstrations; reviewed as insufficient for superintelligent alignment
  • Method for fine-tuning LMs based on human preferences; mentioned as combining RL and LMs.
  • Proposed experimental paradigm to train morphogenesis using rewards and punishments, treating tissues as learning agents.
  • The hypothesis that cellular collectives can be trained via rewards/punishments to produce specific morphological outcomes.
  • Variant of RLHF where human feedback is replaced with AI-generated feedback for harmlessness.