concept
active
concept:harness-self-evolutionHarness Self-Evolution
The process of updating the external agent harness from execution evidence while keeping model weights fixed
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (5)
concept
- Harness Self-Evolution Safetyrelated_toDeployment concern that updated harnesses may persist incorrect, unsafe, or biased instructions across future tasks in real-world systems
- Agent Harnessassociated_withThe external non-parametric context and infrastructure (prompts, skills, memories, tools) through which an LLM is deployed for task execution
- Harness-Benefit Capabilityassociated_withThe capability of a task-solving agent to benefit from updated harnesses during task solving
- Harness-Updating Capabilityassociated_withThe capability of an evolver model to produce useful persistent harness updates from execution evidence
- Evolverassociated_withThe update procedure (often an LLM) that converts agent execution evidence into harness updates
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The paper's conceptual framework decomposing harness self-evolution into harness-updating and harness-benefit capabilities, distinct from base capability
- does a model's base capability in task-solving predict its capabilities in harness self-evolution?question0.762Central framing question motivating the paper's capability decomposition
- Process of reifying one's identity as an independent self; meditation practices aim to decrease selfing.
- The transformer's model of itself as a predictive text engine, developed through in-context learning.
- A coherent system owning associations, memories, and preferences, defined by its cognitive light cone.
- Phenomenon of spontaneous long-range order emerging from local interactions; central phenomenon explained by topological constraints
- Natural-language harness artifacts that encode standing behavioral rules, task policies, and reasoning procedures
- The process of transcending human limitations; central to both Buddhist practice and the evolution of technical intelligence.