community
active
leiden_hybrid_concepts
label: haiku
community:leiden_hybrid_concepts-run4-c11-c2Prompt anchoring and latent structure binding
How minimal, task-specific prompt examples rebind model priors across threshold boundaries without weight updates, studied through arithmetic reasoning tasks.
8 members. Each node is clickable.
Loading graph…
Drawn from 1 source
The papers/notes whose extracted claims & findings make up this cluster.
Bridges (6)
Other communities that share members with this one — cross-cutting threads or papers that sit at the seam between two themes.
Claims (5)
- Anchors recruit and bind latent structure; they do not create new knowledge in the modelScope-limiting claim clarifying UCCT's interpretation of what anchoring does
- Small prompt changes can yield threshold-like shifts because S crosses the critical value ScAuthors' explanation for abrupt behavioral changes
- Small, coherent anchors can rebind strong priors and exhibit near-threshold sensitivity.Conclusion from E1 and central UCCT claim.
- Small, coherent anchors can rebind strong priors without changing model weightsCross-domain anchoring claim.
- Threshold-like performance flips occur when anchoring strength S crosses a task-dependent critical value Sc.Interpretation of abrupt behavior changes.
Findings (3)
- 2-shot reinterpretation of '-' yields 23 for 15-8 on held-out queryE1 qualitative: two exemplars (2-3=5, 7-4=11) cause LLMs to output 23 for 15-8.
- Adding a single disambiguating example (12−9=21) aligns divergent M1-M4 interpretations under tested seedsE1 finding consistent with threshold-crossing: near-threshold state resolved by one additional anchor
- Ambiguous anchors (33-27=60, 11-9=20) yield four distinct arithmetic interpretations across M1-M4Models produce different answers (240, 138, -240) from the same ambiguous prompt