prediction
pending-review
prediction:f0fddc41ab9688f4ReflectiveBench leaderboard will launch at aboutblank.pub with 3-5 dimensions and 20-30 models within 12 months.
/Users/antonborzov/Documents/Claude/work/AboutBlank/strategy/2026-05-12_room-to-play-in-eval-cohort.mdFrontmatter (11 fields)
{
"doc": "/Users/antonborzov/Documents/Claude/work/AboutBlank/strategy/2026-05-12_room-to-play-in-eval-cohort.md",
"claim": "ReflectiveBench leaderboard will launch at aboutblank.pub with 3-5 dimensions and 20-30 models within 12 months.",
"horizon": "2027-05-12",
"section": "Arena (the public-leaderboard shape)",
"category": "adoption",
"confidence": 0.65,
"depends_on": [
"False Floor paper ships; Aliveness Battery operationalized"
],
"extracted_at": 1778899700,
"extracted_by": "haiku-4-5",
"status_check_log": [],
"evidence_threshold": "Public leaderboard live at aboutblank.pub/leaderboard/ with quarterly re-runs; minimum 20 frontier models ranked across self-observation, depth-resilience, and 1-3 additional dimensions from published papers"
}Outgoing (1)
Extracted from (1)
Incoming (0)
None.