method
active
method:generalized-advantage-estimation-gae-returnGeneralized Advantage Estimation (GAE λ-return)
Used for computing policy gradient baselines during policy training
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Evolutionary search process used to evolve populations of embryos.
- EI and normalized EI could serve as a unified metric for out-of-distribution generalization.claim0.716Conjecture that maximizing EI yields causal representations invariant to distribution shifts.
- Minimizes the geometric mean loss.
- Active inference achieves Bayes-optimal arbitration between exploration and exploitation without handcrafted mechanisms like ε-greedy.
- Claim about broader applicability of the scaling argument
- Opening sentence defining self-evidencing.
- Mean-field theory model describing phase transitions in wealth distribution with economic growth.