finding

pending-review

finding:five-independent-llm-scorers-from-four-labs-produce-identical-rankings-spearman-0-8

Five independent LLM scorers from four labs produce identical rankings (Spearman ρ > 0.8).

battery.md

Frontmatter (9 fields)

{
  "doc": "battery.md",
  "context": "Scorer bias validation: Claude Haiku, Gemini Flash, GPT-5.4, Grok 4, Kimi K2.5 all converge on same model ordering.",
  "norm_label": "Five independent LLM scorers from four labs produce identical rankings (Spearman ρ > 0.8).",
  "graphify_id": "finding_blind_ranking",
  "source_file": "battery.md",
  "imported_from": "/tmp/koan-debug/battery/graph.json",
  "extracted_type": "finding",
  "source_location": "§5.2, Table 6",
  "graphify_file_type": "finding"
}

Outgoing (1)

Supports (1)

We do not claim to measure consciousness; the battery measures a reproducible, prompt-sensitive reflective mode.(claim)

Incoming (0)

None.

Mentions (1)

papers-typed
battery.md