finding
active
finding:q-learning-epsilon-1-decaying-to-0-achieved-average-score-80-44-78-96-81-93-in-deterministic-frozenlakeQ-learning (epsilon=1 decaying to 0) achieved average score 80.44 [78.96, 81.93] in deterministic FrozenLake.
Table 1.
Source paper
extracted_from(2021) · Noor Sajid · Philip J. Ball · Thomas Parr · Karl J. Friston
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Table 2 first row; reward shaping section.
- Bayesian model-based RL achieved average score 99.76 [99.45, 100.00] in deterministic FrozenLake.finding0.817Table 1.
- Table 1.
- Table 1, deterministic environment row.
- Discussion of Figure 3.
- Table 2 first row; reward shaping section.
- Table 2, row 3, showing equivalence when prior preferences match rewards.
- Key empirical result validating online planning capability of active inference.