finding
pending-review
finding:five-independent-llm-scorers-from-four-labs-produce-identical-rankings-spearman-0-8Five independent LLM scorers from four labs produce identical rankings (Spearman ρ > 0.8).
battery.mdFrontmatter (9 fields)
{
"doc": "battery.md",
"context": "Scorer bias validation: Claude Haiku, Gemini Flash, GPT-5.4, Grok 4, Kimi K2.5 all converge on same model ordering.",
"norm_label": "Five independent LLM scorers from four labs produce identical rankings (Spearman ρ > 0.8).",
"graphify_id": "finding_blind_ranking",
"source_file": "battery.md",
"imported_from": "/tmp/koan-debug/battery/graph.json",
"extracted_type": "finding",
"source_location": "§5.2, Table 6",
"graphify_file_type": "finding"
}Outgoing (1)
Incoming (0)
None.
Mentions (1)
- papers-typed
battery.md