method
active
method:kruskal-wallis-testKruskal-Wallis Test
Statistical test used to determine which factors predict koan battery scores across 28 models
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- A test of intelligence via linguistic performance; deemed insufficient for sentience assessment by Levin.
- Non-parametric statistical test used to assess significance of Φ differences between ToM score categories.
- Tests like Turing test, Artificial Consciousness Test; argued to be unreliable for AI due to mimicry.
- Statistical test used to confirm that EFE after sticker removal is significantly lower than before
- Paper identifies as a research gap requiring internal analysis methods rather than behavioral benchmarks
- Proposed test for AI consciousness by Schneider and Turner; uses verbal outputs.
- Eliezer Yudkowsky's benchmark for LLM awareness, mentioned as test that collapsed-awareness models might fail.
- Core claim that standard criteria fail for novel agents.