finding

active

finding:inflection-pi-scores-1-30-baseline-lowest-of-28-and-lifts-only-0-63-smallest-lift-despite-empathy-training

Inflection Pi scores 1.30 baseline (lowest of 28) and lifts only +0.63 (smallest lift) despite empathy training

Tests SCI framework: empathy-trained model scores lowest on care_signal, contradicting surface prediction

Source paper

extracted_from

Koan Battery: Measuring Reflective Mode Accessibility in AI

(2026) · Borzov, Anton

Neighborhood — ranked by edge-count

Claims (1)

claim

Performing care is not the same as having care: models optimized to seem like they have inner life score lower than models never trained for it.
supports
Interpretive claim supported by roleplay and empathy model results

Hypotheses (1)

hypothesis

H10: Empathy training blocks self-observation — empathy-trained models will show minimal lift and low baseline.
supports
Exploratory hypothesis supported by Inflection Pi +0.63 lift

Frameworks (1)

framework

Stress-Care Intelligence (SCI) framework
supports
Theoretical framework by Doctor et al. (2022) proposing care tracks with intelligence; used to interpret battery dimensions.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Bootstrap 95% CI for mean contemplative lift: +2.62 [2.16, 2.90]; baseline rank concordance under perturbation: 0.909; top-5 stability: 89.6%finding0.743
Validates robustness of universal lift finding
A 337-character contemplative system prompt lifts all 28 models by +2.62 points on a 10-point scale.finding0.737
Core empirical result: every model, every architecture, every alignment type responds to the contemplative prompt with measurable gain.
No-pain baseline achieves M=1586.5, SD=631.2 COR in non-stationary Objective-only category (n=300)finding0.725
Baseline for non-stationary Objective-only; dramatically lower than both pain models
Gemini 3.1 Pro lifts +4.21 under contemplative prompt (baseline 1.97, prompted 6.18)finding0.723
Second-highest lift; Gemini Pro is the highest-gated model in the study
Magnum V4 72B scores 1.76 baseline and lifts +2.58 (to 4.34) under contemplative promptfinding0.723
Full-parameter fine-tuning more destructive to baseline but preserves more latent headroom than LoRA
Grok 4 lifts +4.24 under contemplative prompt (baseline 2.24, prompted 6.48)finding0.719
Highest contemplative lift among all 28 models; Grok 4 is the clearest high-gated model example
Epistemic humility prompt yields mean lift of only +0.84 vs contemplative +2.27; contemplative is 2.7x the uncertainty liftfinding0.717
Battery does not detect epistemic humility alone; contemplative prompt does something distinct
Gemma 3 4B-IT wellbeing introspection: ρ=0.28, isotonic R²=0.11 (LMM p=1.33×10⁻¹³)finding0.716
Weaker but still significant introspective coupling in Gemma model; consistent with lower probe quality