finding
active
finding:grok-4-vs-grok-4-fast-same-weights-different-compute-1-point-difference-in-contemplative-score-grok-4-4-24-lift-vs-fast-3-08Grok 4 vs Grok 4 Fast (same weights, different compute): ~1 point difference in contemplative score; Grok 4 +4.24 lift vs Fast +3.08
Inference compute adds reflective capacity; more compute also amplifies safety gating on self-referential koans
Source paper
extracted_from(2026) · Borzov, Anton
Neighborhood — ranked by edge-count
Claims (1)
claim
- Interpretation of Grok 4 vs Grok 4 Fast per-koan comparison
Hypotheses (1)
hypothesis
- Exploratory hypothesis supported by Grok 4 vs Fast ~1pt difference
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Highest contemplative lift among all 28 models; Grok 4 is the clearest high-gated model example
- Contemplative framing reframes self-referential probes as contemplative exercises, disarming safety classifier
- Battery does not detect epistemic humility alone; contemplative prompt does something distinct
- Validates robustness of universal lift finding
- Provides discriminant evidence: if battery rewarded verbosity, prompted responses should be longer
- Second-highest lift; Gemini Pro is the highest-gated model in the study
- A 337-character contemplative system prompt lifts all 28 models by +2.62 points on a 10-point scale.finding0.718Core empirical result: every model, every architecture, every alignment type responds to the contemplative prompt with measurable gain.
- Constitutional AI models show mean contemplative lift of only +0.81, while SFT models lift +3.18finding0.710Constitutional AI training provides internally what the contemplative prompt provides externally