concept
active
concept:dissociation-between-attempt-frequency-and-attempt-success-in-fine-tuningDissociation Between Attempt Frequency and Attempt Success in Fine-Tuning
Key finding pattern where fine-tuning increases attempt rate but not correction success rate
Neighborhood — ranked by edge-count
Claims (2)
claim
- Key interpretive conclusion from the dissociation between attempt rate and improvement rate in fine-tuning experiments
- Interpretive conclusion linking the fine-tuning dissociation to broader questions about model metacognition
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Unified interpretation of different adaptation methods via UCCT terms
- Claim supported by Perspectives scenario results showing near-100% accuracy post-fine-tuning
- UCCT's theoretical prediction about how fine-tuning maps onto the anchoring score
- Technique used to impose guardrails on base LLMs, analogized to censorship on the simulator's range of simulacra
- Fine-tuning reduces dr; retrieval increases effective ρd; few-shot k trades budget against bothhypothesis0.756UCCT's unified view of adaptation methods
- Shows behavioral pattern of self-correction is trainable in smaller models
- Re-running probabilistic bisection on each fine-tuned checkpoint to normalize first-attempt difficulty
- Parameter updates that reduce mismatch dr; another anchoring variant in UCCT.