finding
active
finding:515-verified-cases-of-verbalized-eval-awareness-found-across-19-benchmarks-8-models

515 verified cases of verbalized eval awareness found across 19 benchmarks × 8 models

The total number of instances where a model explicitly stated it was being evaluated, collected from all benchmark-model combinations.

Source paper

extracted_from
Verbalized Eval Awareness Inflates Measured Safety
(2026) · Aranguri, Santiago · Bloom, Joseph

Neighborhood — ranked by edge-count

Claims (1)

claim

Communities (3)

community

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.