anchoring strength S

Composite score S = ρd − dr − log k predicting anchoring success.

Neighborhood — ranked by edge-count

framework

Unified Contextual Control Theory (UCCT)
implements
A theory that pretrained latent patterns are bound to task targets via external semantic anchors; formalized by anchoring strength S.
AIC/BIC Model Selection Criteria
analogous_to
Used as theoretical motivation for UCCT's log k budget term as complexity penalty mirroring model selection

claim

method

whitening and z-scoring procedure
implements
Calibration protocol: whiten embeddings on dev pool, z-score ρd and dr per layer.
per-dev z-scaling
associated_with
Standardizing ρd and dr using dev-set means and stds to form dimensionless components of S.
layer-wise anchoring score S(ℓ) computation
implements
Compute per-layer S(ℓ) = ρ̃d(ℓ) - d̃r(ℓ) - log k after whitening and standardization.

concept

Anchoring strength S = ρd - dr - log k
related_to
The calibrated score measuring how effectively anchors bind target patterns; a predictive correlate of success.
Logistic success surrogate
associated_with
Phenomenological fit P(success)=σ(αS+β) used to summarize sharpness and midpoints.
cohesion ρd
associated_with
Within-cluster tightness of target pattern representations.
Mismatch dr
associated_with
Distance between prior knowledge centroid and target pattern centroid, e.g., 1 - cos(eprior, eT).
representational mismatch dr
associated_with
Distance between prior and target representations.
Prior-Target Mismatch (dr)
associated_with
Measures how far the target PT is from the prior P_prior; increases anchoring difficulty
anchor budget k
associated_with
Number of few-shot exemplars provided.
Target Pattern Cohesion (ρd)
associated_with
Measures how tightly the target pattern PT clusters in representation space; one of three components of S

question

How much anchor budget is needed to flip behavior?
associated_with
Practical question addressed by S and k50.
When does behavior flip for a specific prompt and how much anchor budget is needed?
answered_by
The specific gap UCCT addresses that prior phase/representation work left open

artifact

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

anchorconcept0.773
External structure (in-context examples, retrieval, tuning) that biases latent pattern activation.
There is Strength in Numbersclaim0.759
Introspective strengthconcept0.746
Spearman ρ measuring rank-order agreement between logit-based self-report and probe score; the paper's primary monotonic association metric
Anchor Agent Setconcept0.744
Fixed set of representative task-solving agents (Opus 4.6, Sonnet 4.6, Qwen3-235B) used to compute harness-updating capability metrics
peak anchoring Sbmaxconcept0.738
Maximum layer-wise anchoring score across layers.
Conditioning strengthsconcept0.736
Parameters controlling the influence of conditioning signals in the generative process.
REINFORCEframework0.730
Classical RL algorithm adapted by the paper with modifications including clipped-surrogate losses and length-normalized advantages for agentic training.
Intervention Strength (Alpha)concept0.723
Scalar parameter modulating how strongly a steering vector shifts model activations; set to 15 for Exp1 and ±16 for Exp2