concept
active
concept:strawberry-test

strawberry test

Eliezer Yudkowsky's benchmark for LLM awareness, mentioned as test that collapsed-awareness models might fail.

Neighborhood — ranked by edge-count

Papers (1)

paper

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Testing five phrasings of the self-referential prompt to confirm robustness to wording variation
  • Turing Testframework0.720
    A test of intelligence via linguistic performance; deemed insufficient for sentience assessment by Levin.
  • Evaluation method where cells are permanently or temporarily disabled to test fault tolerance of learned circuits
  • The behavioral paradigm (mark/sticker placed on face, checked in mirror) used to evaluate self-awareness in animals and infants
  • Creating physical mockups to compare which alternative produces the deepest feeling (used in the Great Hall colors, Eishin wall mockups, and molding).
  • Fruitconcept0.698
  • Tests like Turing test, Artificial Consciousness Test; argued to be unreliable for AI due to mimicry.
  • Werewolf benchmarkframework0.695
    LLM benchmark on the communication game Werewolf, cited.