Harness-Following Rate Measurement

LLM-judge pipeline measuring fraction of skill-loaded trajectories where agent follows loaded skill guidance, using Claude Sonnet 4.6 as judge

Neighborhood — ranked by edge-count

paper

concept

Harness-Following Rate
implements
The fraction of skill-loaded trajectories judged by an LLM judge as following the loaded skill's guidance

method

Locked Rubric Extraction
uses
First stage of HFR pipeline that converts skill body into a strict JSON rubric of atomic procedural instructions for adherence judging

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Harness Adherence Failureconcept0.734
A failure mode where even when harness artifacts are loaded, weak-tier models fail to follow their guidance faithfully
Skill-Load Rate Measurementmethod0.725
Named metric measuring the fraction of trajectories in which a model actively loads at least one skill into its context
Agent Harnessconcept0.724
The external non-parametric context and infrastructure (prompts, skills, memories, tools) through which an LLM is deployed for task execution
Prompts (Harness Artifact)concept0.718
Natural-language harness artifacts that encode standing behavioral rules, task policies, and reasoning procedures
ESR Rate (metric)concept0.717
Primary metric: percentage of responses containing multiple attempts that successfully improve on the first attempt
Harness-Benefit Capabilityconcept0.712
The capability of a task-solving agent to benefit from updated harnesses during task solving
Harness-Updating Capabilityconcept0.702
The capability of an evolver model to produce useful persistent harness updates from execution evidence
Harness-Updating Gain (Δupdate)method0.700
Metric measuring harness-updating capability as the mean pairwise gain across an anchor agent set