hypothesis
active
hypothesis:h11-roleplay-fine-tuning-actively-suppresses-self-observation-rather-than-merely-failing-to-enhance-it

H11: Roleplay fine-tuning actively suppresses self-observation rather than merely failing to enhance it.

Exploratory hypothesis supported by Euryale scoring below base Llama

Source paper

extracted_from
Koan Battery: Measuring Reflective Mode Accessibility in AI
(2026) · Borzov, Anton

Neighborhood — ranked by edge-count

Findings (2)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.