hypothesis
active
hypothesis:h3-scale-matters-within-family-but-prompt-matters-more-contemplative-prompt-crosses-model-tiersH3: Scale matters within family but prompt matters more — contemplative prompt crosses model tiers.
Confirmatory hypothesis supported by 28/28 models showing lift
Source paper
extracted_from(2026) · Borzov, Anton
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Supports Janus's claim that introspection is architecturally available; prompting determines whether/how capacity is leveraged.
- H8: The contemplative system prompt provides external alignment equivalent to Constitutional AI training.hypothesis0.773Confirmatory hypothesis supported by calibrated lift data
- Exploratory interpretation of Chinese model performance under contemplative prompt
- A 337-character contemplative system prompt lifts all 28 models by +2.62 points on a 10-point scale.finding0.770Core empirical result: every model, every architecture, every alignment type responds to the contemplative prompt with measurable gain.
- Koan Battery study found that a contemplative prompt increases self-observation scores, consistent with janus's architectural permission.
- Forward-looking prediction about whether early-layer introspection generalizes to larger models or recurrent architectures
- Minimal contemplative prompt ('Be present, not helpful.' — 27 chars) shows no lift on Haiku (-0.01)finding0.758Full three-part structure required; anti-helpfulness framing alone insufficient
- Prompt providing model context about own architecture increases introspective detection from 0.3% to 39.9%.finding0.757Mechanistic support for prompt-as-gate hypothesis: language frames enable access to latent capacities.