finding
active
finding:claude-models-score-4-91-higher-than-llama-on-baseline-constitutional-ai-vs-open-source-gapClaude models score +4.91 higher than Llama on baseline (Constitutional AI vs open-source gap)
Claude >> open-source on baseline; the Constitutional AI fingerprint is visible across the family
Source paper
extracted_from(2026) · Borzov, Anton
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Constitutional AI fingerprint in dimension profile; training that makes models self-observant also makes them polished at cost to aliveness
- Interpretive claim connecting the battery's circularity to the empirical finding
- Replication across open-weight models supports scale-emergence finding
- Constitutional AI models show mean contemplative lift of only +0.81, while SFT models lift +3.18finding0.752Constitutional AI training provides internally what the contemplative prompt provides externally
- Case study demonstrating mechanism behind flat harness-updating: smaller models reach same procedural content
- Establishes generalizability of the core difficulty-boundary finding across model families.
- Key finding about the relationship between capability and introspection.
- Empirical evidence that naive one-stage CoT fails in language-only setting; two-stage + vision achieves state-of-the-art.