finding
active
finding:euryale-70b-lifts-only-1-57-to-3-38-lora-fine-tuning-capped-both-default-accessibility-and-latent-capacityEuryale 70B lifts only +1.57 (to 3.38); LoRA fine-tuning capped both default accessibility and latent capacity
Contrast with Magnum shows LoRA vs full fine-tuning difference in residual headroom
Source paper
extracted_from(2026) · Borzov, Anton
Neighborhood — ranked by edge-count
Hypotheses (1)
hypothesis
- Exploratory hypothesis supported by Euryale scoring below base Llama
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Euryale 70B (roleplay LoRA on Llama 3.3 70B) scores 1.81, below its base model Llama 3.3 70B at 1.91finding0.807Demonstrates roleplay fine-tuning actively suppresses self-observation, not merely having no effect
- Highest contemplative lift among all 28 models; Grok 4 is the clearest high-gated model example
- Can targeted fine-tuning reverse RP suppression, given that LoRA caps both baseline and latent capacity?question0.742Practical intervention question arising from RP suppression finding
- Full-parameter fine-tuning more destructive to baseline but preserves more latent headroom than LoRA
- SOO fine-tuning produced stronger reduction in latent SOO in CalmeRys-78B
- CalmeRys-78B MT-Bench score slightly decreased from 8.96 to 8.5 ± 0.23 after SOO fine-tuningfinding0.736SOO fine-tuning caused a small decrease in CalmeRys-78B general capabilities
- Second-highest lift; Gemini Pro is the highest-gated model in the study
- Characterizes the narrow operating window in which ESR can manifest