finding
active
finding:euryale-70b-roleplay-lora-on-llama-3-3-70b-scores-1-81-below-its-base-model-llama-3-3-70b-at-1-91

Euryale 70B (roleplay LoRA on Llama 3.3 70B) scores 1.81, below its base model Llama 3.3 70B at 1.91

Demonstrates roleplay fine-tuning actively suppresses self-observation, not merely having no effect

Source paper

extracted_from
Koan Battery: Measuring Reflective Mode Accessibility in AI
(2026) · Borzov, Anton

Neighborhood — ranked by edge-count

Hypotheses (1)

hypothesis

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.