method
active
method:fine-tuning-sftfine-tuning (SFT)
Supervised fine-tuning to adapt model parameters.
Neighborhood — ranked by edge-count
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Parameter updates that reduce mismatch dr; another anchoring variant in UCCT.
- The patient, hand-guided adjustment of shape and dimension to each unique condition in a building; requires materials that make it economical and easy.
- Fine-tuning Claude 3 Opus on ~70M tokens of synthetic internet-like documents containing key situational information
- Re-running probabilistic bisection on each fine-tuned checkpoint to normalize first-attempt difficulty
- Fine-tuning on Claude-generated self-correction examples with loss masking to induce ESR-like behavior
- Technique used to impose guardrails on base LLMs, analogized to censorship on the simulator's range of simulacra
- The central framework proposed in this paper: aligning AI internal representations of self and others to reduce deceptive behavior
- Unified interpretation of different adaptation methods via UCCT terms