finding
active
finding:q-learning-epsilon-1-decaying-to-0-achieved-average-score-80-44-78-96-81-93-in-deterministic-frozenlake

Q-learning (epsilon=1 decaying to 0) achieved average score 80.44 [78.96, 81.93] in deterministic FrozenLake.

Table 1.

Source paper

extracted_from
Active inference: demystified and compared
(2021) · Noor Sajid · Philip J. Ball · Thomas Parr · Karl J. Friston

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.