concept
active
concept:anchoring-strength-s

anchoring strength S

Composite score S = ρd − dr − log k predicting anchoring success.

Neighborhood — ranked by edge-count

Frameworks (2)

framework

Methods (3)

method

Concepts (8)

concept
  • The calibrated score measuring how effectively anchors bind target patterns; a predictive correlate of success.
  • Phenomenological fit P(success)=σ(αS+β) used to summarize sharpness and midpoints.
  • cohesion ρd
    associated_with
    Within-cluster tightness of target pattern representations.
  • Mismatch dr
    associated_with
    Distance between prior knowledge centroid and target pattern centroid, e.g., 1 - cos(eprior, eT).
  • Distance between prior and target representations.
  • Measures how far the target PT is from the prior P_prior; increases anchoring difficulty
  • anchor budget k
    associated_with
    Number of few-shot exemplars provided.
  • Measures how tightly the target pattern PT clusters in representation space; one of three components of S

Questions (2)

question

Artifacts (1)

artifact

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • anchorconcept0.773
    External structure (in-context examples, retrieval, tuning) that biases latent pattern activation.
  • Spearman ρ measuring rank-order agreement between logit-based self-report and probe score; the paper's primary monotonic association metric
  • Anchor Agent Setconcept0.744
    Fixed set of representative task-solving agents (Opus 4.6, Sonnet 4.6, Qwen3-235B) used to compute harness-updating capability metrics
  • Maximum layer-wise anchoring score across layers.
  • Parameters controlling the influence of conditioning signals in the generative process.
  • REINFORCEframework0.730
    Classical RL algorithm adapted by the paper with modifications including clipped-surrogate losses and length-normalized advantages for agentic training.
  • Scalar parameter modulating how strongly a steering vector shifts model activations; set to 15 for Exp1 and ±16 for Exp2