claim
active
claim:higher-density-priors-b10-are-more-robust-to-fine-tuning-than-lower-density-ones-b9Higher-density priors (B10) are more robust to fine-tuning than lower-density ones (B9).
Interpretation of cross-base transfer asymmetry.
Source paper
extracted_from(2025) · Edward Yi Chang · Kaya, Zeyneb N. · Ethan Chang
Neighborhood — ranked by edge-count
Findings (1)
finding
- E2 asymmetric transfer finding consistent with UCCT's mismatch-driven OOD fragility
Communities (3)
community
- Few-shot anchoring & latent structuremembers_ofHow minimal examples disambiguate and recruit latent arithmetic/reasoning interpretations in LLMs
- Investigates how neural network bases (B10 vs B9) differ in fine-tuning stability and cross-base transfer, with density as a key predictor of robustness.
- Empirical finding that base-10 representations transfer more robustly out-of-distribution than base-9 after fine-tuning.
Concepts (1)
concept
- cross-base interferenceassociated_withAsymmetric transfer after fine-tuning: high-density bases (B10) are more robust.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- UCCT's theoretical prediction about how fine-tuning maps onto the anchoring score
- Strong priors require higher-cohesion anchors to overcome, manifesting as delayed thresholds or reduced transferhypothesis0.775Prediction for Experiment 1 cross-domain anchoring
- Cross-base fine-tuning yields asymmetric transfer: B10 transfers most robustly, B9 leastfinding0.774In-base gains accompanied by uneven OOD drops; higher-density priors more robust.
- Hypothesis: Shot midpoint ordering k50(B10) < k50(B8) ≈ k50(B9) follows pretraining exposure densityhypothesis0.763E2 prediction that bases with higher pretraining exposure require fewer shots to cross threshold
- Unified interpretation of different adaptation methods via UCCT terms
- Explains why the boundary appears fixed: the prior hides its context-dependent nature.
- Do pattern density ρd and prior-target distance dr serve as predictive correlates of few-shot thresholds?question0.754First E2 research question directly testing UCCT's core predictive claims
- Fine-tuning reduces dr; retrieval increases effective ρd; few-shot k trades budget against bothhypothesis0.753UCCT's unified view of adaptation methods
Restated by (1)
cosine ≥ 0.90Other entities that say roughly the same thing. May be merge candidates or independent restatements across papers.