claim
active
claim:s-is-a-predictive-correlate-calibrated-on-dev-sets-not-an-absolute-measureS is a predictive correlate calibrated on dev sets, not an absolute measure
Clarifies nature of S.
Source paper
extracted_from(2025) · Edward Yi Chang · Kaya, Zeyneb N. · Ethan Chang
Neighborhood — ranked by edge-count
Findings (1)
finding
- Shows S predicts anchoring effectiveness.
Communities (2)
community
- Few-shot anchoring & latent structuremembers_ofHow minimal examples disambiguate and recruit latent arithmetic/reasoning interpretations in LLMs
- Predictive metric S = ρd - dr - log k quantifies when LLM behavior sharply transitions across few-shot, SFT, and CoT settings via layer-wise calibration.
Artifacts (1)
artifact
- Main paper presenting UCCT and semantic anchoring framework.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- A central claim about the operational value of S.
- S = ρd − dr − log k is a predictive correlate of anchoring success across few-shot, SFT, and CoT.claim0.763UCCT's practical utility claim.
- Fixed dev pool of 1000 prompts used for whitening and z-scoring parameters.
- Claim that S predicts threshold midpoints across different bases, tasks, and models
- Predictive practical utility claim.
- Claim about broader applicability of the scaling argument
- Kruskal-Wallis test result: Constitutional AI predicts highest baseline; roleplay/empathy training predict lowest.
- Illustrates how non-separable functions shift identity to the collective level.