method
active
method:negative-control-dense-off-task-anchorsNegative Control: Dense Off-Task Anchors
E3 robustness test: dense but off-task anchors yield high ρd AND high dr, confirming mismatch dominates S
Neighborhood — ranked by edge-count
Methods (1)
method
- Quantitative study correlating layer-wise anchoring geometry (S_max, AUS_N) with behavioral thresholds θ50
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- E3 negative control validating that both ρd AND dr must be favorable for S to exceed Sc
- Negative control ('precise analytical assistant') suppresses scores: Haiku -0.64, GPT-5.4 -1.06finding0.738Confirms specificity of contemplative prompt; analytical framing increases task focus at expense of self-observation
- Conclusion from E1 and central UCCT claim.
- Biological analogue to ESR where top-down mechanisms detect distracting inputs and redirect processing
- Adaptation of Hewitt and Liang control tasks to CausalGym: next-token labels replaced with arbitrary tokens to measure method expressivity
- Suppressing the feature makes the model ignore bugs.
- Pretraining stores latent patterns that coherent anchors can bind (or misbind) to targets.quote0.702Load-bearing quote capturing the core metaphor