prediction
pending-review
prediction:f0fddc41ab9688f4

ReflectiveBench leaderboard will launch at aboutblank.pub with 3-5 dimensions and 20-30 models within 12 months.

/Users/antonborzov/Documents/Claude/work/AboutBlank/strategy/2026-05-12_room-to-play-in-eval-cohort.md
Frontmatter (11 fields)
{
  "doc": "/Users/antonborzov/Documents/Claude/work/AboutBlank/strategy/2026-05-12_room-to-play-in-eval-cohort.md",
  "claim": "ReflectiveBench leaderboard will launch at aboutblank.pub with 3-5 dimensions and 20-30 models within 12 months.",
  "horizon": "2027-05-12",
  "section": "Arena (the public-leaderboard shape)",
  "category": "adoption",
  "confidence": 0.65,
  "depends_on": [
    "False Floor paper ships; Aliveness Battery operationalized"
  ],
  "extracted_at": 1778899700,
  "extracted_by": "haiku-4-5",
  "status_check_log": [],
  "evidence_threshold": "Public leaderboard live at aboutblank.pub/leaderboard/ with quarterly re-runs; minimum 20 frontier models ranked across self-observation, depth-resilience, and 1-3 additional dimensions from published papers"
}

Outgoing (1)

Incoming (0)

None.