finding
active
finding:grok-4-vs-grok-4-fast-same-weights-different-compute-1-point-difference-in-contemplative-score-grok-4-4-24-lift-vs-fast-3-08

Grok 4 vs Grok 4 Fast (same weights, different compute): ~1 point difference in contemplative score; Grok 4 +4.24 lift vs Fast +3.08

Inference compute adds reflective capacity; more compute also amplifies safety gating on self-referential koans

Source paper

extracted_from
Koan Battery: Measuring Reflective Mode Accessibility in AI
(2026) · Borzov, Anton

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.