concept
active
concept:raileanu-et-al-2018-modeling-others-using-oneself-in-multi-agent-reinforcement-learningRaileanu et al. 2018 - Modeling Others Using Oneself in Multi-Agent Reinforcement Learning
Reference for Self-Other Modeling (SOM) framework, a related but less scalable approach to SOO
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Argument that RL meets the agency indicator.
- Key insight linking individual rewards to system-level learning.
- Alternative framework for agent behavior; based on reward maximization rather than free energy minimization.
- Reinforcement learning methods that update parameters at the end of an episode based on sampled returns.
- Method for fine-tuning LMs based on human preferences; mentioned as combining RL and LMs.
- RL variant that maintains beliefs over environment model; compared to active inference using Thompson sampling.
- RLHF paper cited as a major fine-tuning technique used in commercial dialogue agents
- §3 Discussion.