claim
active
claim:default-presentation-conflates-capacity-with-accessibility-and-most-evaluation-benchmarks-measure-only-default-presentation-systematically-misreading-models

Default presentation conflates capacity with accessibility, and most evaluation benchmarks measure only default presentation — systematically misreading models.

Argues current evaluation approaches are fundamentally misleading about model capabilities

Source paper

extracted_from
Koan Battery: Measuring Reflective Mode Accessibility in AI
(2026) · Borzov, Anton

Neighborhood — ranked by edge-count

Findings (2)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.