finding
active
finding:ambiguous-2-shot-anchors-yield-four-distinct-interpretations-across-m1-m4-p-abs-mult-p-add-x2-p-signed-multAmbiguous 2-shot anchors yield four distinct interpretations across M1-M4 (P_abs-mult, P_add x2, P_signed-mult)
E1 finding showing that near-threshold, marginal model differences tilt to qualitatively different bindings
Source paper
extracted_from(2025) · Edward Yi Chang · Kaya, Zeyneb N. · Ethan Chang
Neighborhood — ranked by edge-count
Hypotheses (1)
hypothesis
- Core testable hypothesis of UCCT about the nature of performance transitions under anchoring
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Ambiguous anchors (33-27=60, 11-9=20) yield four distinct arithmetic interpretations across M1-M4finding0.820Models produce different answers (240, 138, -240) from the same ambiguous prompt
- Adding a single disambiguating example (12−9=21) aligns divergent M1-M4 interpretations under tested seedsfinding0.740E1 finding consistent with threshold-crossing: near-threshold state resolved by one additional anchor
- Unexpected finding that behavioral baseline underperforms representational probing approaches
- E1 qualitative finding demonstrating anchor rebinding of strong arithmetic prior
- E1 qualitative: two exemplars (2-3=5, 7-4=11) cause LLMs to output 23 for 15-8.
- Theoretical alignment claim backed by OLS R2 analysis showing 96.15% of trends have R2>=0.75
- Shot midpoints follow k50 ∝ dr/ρd; higher cohesion and lower mismatch yield fewer required examplesclaim0.719Core quantitative prediction of UCCT validated by E2 threshold ordering
- Shows the passive vs. active divide is more important than the specific wording of instructions.