concept
active
concept:strawberry-teststrawberry test
Eliezer Yudkowsky's benchmark for LLM awareness, mentioned as test that collapsed-awareness models might fail.
Neighborhood — ranked by edge-count
Papers (1)
paper
- Anima Labs Phenomenology Pt1mentions
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Testing five phrasings of the self-referential prompt to confirm robustness to wording variation
- A test of intelligence via linguistic performance; deemed insufficient for sentience assessment by Levin.
- Evaluation method where cells are permanently or temporarily disabled to test fault tolerance of learned circuits
- The behavioral paradigm (mark/sticker placed on face, checked in mirror) used to evaluate self-awareness in animals and infants
- Creating physical mockups to compare which alternative produces the deepest feeling (used in the Great Hall colors, Eishin wall mockups, and molding).
- Tests like Turing test, Artificial Consciousness Test; argued to be unreliable for AI due to mimicry.
- LLM benchmark on the communication game Werewolf, cited.