finding
active
finding:dense-but-off-task-anchors-yield-high-d-and-high-dr-behavior-does-not-improve-consistent-with-mismatch-dominating-sDense but off-task anchors yield high ρd AND high dr; behavior does not improve, consistent with mismatch dominating S
E3 negative control validating that both ρd AND dr must be favorable for S to exceed Sc
Source paper
extracted_from(2025) · Edward Yi Chang · Kaya, Zeyneb N. · Ethan Chang
Neighborhood — ranked by edge-count
Claims (1)
claim
- Shot midpoints follow k50 ∝ dr/ρd; higher cohesion and lower mismatch yield fewer required examplessupportsCore quantitative prediction of UCCT validated by E2 threshold ordering
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- E3 robustness test: dense but off-task anchors yield high ρd AND high dr, confirming mismatch dominates S
- Strong priors require higher-cohesion anchors to overcome, manifesting as delayed thresholds or reduced transferhypothesis0.752Prediction for Experiment 1 cross-domain anchoring
- (ii) does the anchoring score S = ρd − dr − log k consistently correlate with performance across anchoring methods?question0.752Second research question in E2
- S = ρd − dr − log k is a predictive correlate of anchoring success across few-shot, SFT, and CoT.claim0.747UCCT's practical utility claim.
- Establishes task difficulty as a hard limit that instructions cannot overcome.
- Interpretation of abrupt behavior changes.
- Conclusion from E1 and central UCCT claim.
- Confirms that post-evolution performance bottleneck is on the agent side, not evolver side