finding
active
finding:grid-search-covers-312-130-subjective-reward-functions-per-environment-after-removing-duplicatesGrid search covers 312,130 subjective reward functions per environment after removing duplicates
Scale of the hyperparameter search establishing thoroughness of optimization
Source paper
extracted_from(2026) · Michael Petrowski · Milica Gašić
Neighborhood — ranked by edge-count
Methods (1)
method
- Hyperparameter Grid SearchsupportsExhaustive search over 312,130 subjective reward functions per environment to find best-performing agents
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The seven categories (Objective only, Expect only, Compare only, and four combinations) structuring the experiment
- Large-scale replication supporting the claim that subjective well-being maps onto prediction error structure
- Motivates active inference's solution: learning prior preferences from interaction rather than external specification.
- ATLAS hypothesis that a compact set of high-level functional tokens (Manip, Shape, Line, Arrow, Text) suffices for multi-domain visual reasoning.
- Framework from Singh, Lewis, and Barto 2009 used to select best-performing reward functions via grid search
- Seven categories determined by which components of f[h] are activated: Objective only, Expect only, Compare only, and combinations
- Table 2, row 3, showing equivalence when prior preferences match rewards.