concept
active
concept:lowe-et-al-2017-multi-agent-actor-critic-for-mixed-cooperative-competitive-environmentsLowe et al. 2017 - Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Source paper for the MADDPG algorithm used in RL baseline experiments
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Dismissal of earlier criteria as too narrow.
- Motivational statement for the benchmark design philosophy.
- Connects collective intelligence to evolutionary potential.
- Approach using multiple LLM agents for generation and critique, a key prior approach to improving reflection.
- Probabilistic behaviour of an ensemble used to derive the free-energy principle.
- Paper on LLM-based simulacra of human behaviour; cited as ref 3
- Glaese et al. 2022: Improving alignment of dialogue agents via targeted human judgementsconcept0.727Alignment paper cited as example of RLHF fine-tuning technique; ref 19