method
active
method:multi-agent-deep-deterministic-policy-gradient-maddpgMulti-Agent Deep Deterministic Policy Gradient (MADDPG)
RL algorithm used to train baseline agents in the physical deception environment
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Gradient balancing by solving multi-objective optimization for minimum-norm aggregated gradient.
- A variational approach for dynamic Bayesian inversion of nonlinear causal models, named in this paper.
- Baseline method against which probe-based ranking is compared; more computationally expensive.
- Training-free technique normalizing all task gradients to the maximum gradient norm magnitude
- Generative model substrate for active inference; discrete states, actions, outcomes, and temporal policies.
- Used for updating hidden state expectations; provides dynamical process theory testable against neuronal data
- Intervention method adding the difference in mean activations between two conditions to a representation
- Modeling framework for discrete state-space decision-making under uncertainty, used as generative model in active inference.