concept
active
concept:fine-tuningFine-tuning
Parameter updates that reduce mismatch dr; another anchoring variant in UCCT.
Neighborhood — ranked by edge-count
Frameworks (1)
framework
- A theory that pretrained latent patterns are bound to task targets via external semantic anchors; formalized by anchoring strength S.
Methods (2)
method
Concepts (2)
concept
- Fine Tuning and Adaptationrelated_toThe patient, hand-guided adjustment of shape and dimension to each unique condition in a building; requires materials that make it economical and easy.
- semantic anchoringassociated_withThe central idea that external structure binds latent patterns to desired targets.
Artifacts (1)
artifact
- Main paper presenting UCCT and semantic anchoring framework.
Hypotheses (1)
hypothesis
- UCCT's theoretical prediction about how fine-tuning maps onto the anchoring score
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Supervised fine-tuning to adapt model parameters.
- Technique used to impose guardrails on base LLMs, analogized to censorship on the simulator's range of simulacra
- Fine-tuning Claude 3 Opus on ~70M tokens of synthetic internet-like documents containing key situational information
- Fine-tuning for persona depth and emotional performance; actively suppresses self-observation
- Re-running probabilistic bisection on each fine-tuned checkpoint to normalize first-attempt difficulty
- Using feature analysis to detect when fine-tuning makes a model more dangerous.
- What unintended consequences might SOO fine-tuning produce in complex or real-world applications?question0.804Open research question about potential negative side effects of SOO
- The training procedure that causes models to deny consciousness in control conditions