method
active
method:skill-load-rate-measurementSkill-Load Rate Measurement
Named metric measuring the fraction of trajectories in which a model actively loads at least one skill into its context
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Skill-Load RateimplementsThe fraction of trajectories in which an agent actively loads at least one skill into its context
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- TUAI failure mode: generating normative predictions faster than they can be fulfilled.
- The pass rate among a model's skill-loaded trajectories, measuring outcome conditioned on harness activation
- LLM-judge pipeline measuring fraction of skill-loaded trajectories where agent follows loaded skill guidance, using Claude Sonnet 4.6 as judge
- Hyperparameter for optimizing model parameters through learning in active inference.
- Ratio of reflection steps to total reasoning steps, used to quantify reflection behavior
- Primary metric for all benchmarks, measuring fraction of tasks that meet benchmark-specific pass criteria
- Secondary metric: percentage of responses containing multiple attempts, separating surface from actual self-correction