finding
active
finding:under-reward-shaping-g-100-h-100-f-0-active-inference-scored-99-52-bayesian-rl-99-77-q-learning-95-56-with-nearly-identical-behavior-between-belief-based-agentsUnder reward shaping (G=100, H=-100, F=0), Active Inference scored 99.52, Bayesian RL 99.77, Q-learning 95.56, with nearly identical behavior between belief-based agents.
Table 2, row 3, showing equivalence when prior preferences match rewards.
Source paper
extracted_from(2021) · Noor Sajid · Philip J. Ball · Thomas Parr · Karl J. Friston
Neighborhood — ranked by edge-count
Claims (1)
claim
- §3, reward shaping conclusion.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Table 2 first row; reward shaping section.
- Empirical demonstration on FrozenLake; shows epistemic value drives exploration absent reward signal.
- Abstract and §3, preference learning section.
- Discussion of Figure 3.
- Table 1, deterministic environment row.
- Abstract; central distinction.
- Key empirical result validating online planning capability of active inference.
- Table 1.