paper
active
paper:aranguri-bloom-verbalized-eval-awareness-2026Verbalized Eval Awareness Inflates Measured Safety
/Users/antonborzov/Documents/Research.nosync/papers/aranguri-bloom-verbalized-eval-awareness-2026.mdExternal IDs
title_hash
3dc43b51d2a9dcd21c41db6f57472620abe70c0clegacy_slug
aranguri-bloom-verbalized-eval-awareness-2026Frontmatter (12 fields)
{
"url": "https://www.goodfire.ai/research/verbalized-eval-awareness-inflates-measured-safety",
"tags": [
"eval-awareness",
"safety-benchmarks",
"chain-of-thought",
"alignment",
"applied-research",
"goodfire"
],
"year": 2026,
"saved": "2026-05-14",
"title": "Verbalized Eval Awareness Inflates Measured Safety",
"venue": "Goodfire research post",
"status": "summary-only",
"authors": [
"Santiago Aranguri",
"Joseph Bloom"
],
"dataset": "https://aranguri.github.io/eval_awareness/demo/",
"published": "2026-05-04",
"enrichment": {
"is_stale": true
},
"affiliation": "Goodfire + UK AISI"
}Outgoing (5)
Associated with (2)
- Goodfire(institute)
- Lindsey Introspective Awareness 2026(paper)
Cites (1)
- Aranguri eval_awareness demo dataset(dataset)
Implements (1)
- Eval Awareness(concept)
Member of (1)
- LLM Introspection(community)
Incoming (2)
Authored by (2)
- Joseph Bloom(thinker)
- Santiago Aranguri(thinker)
Mentions (2)
- papers
/Users/antonborzov/Documents/Research.nosync/papers/aranguri-bloom-verbalized-eval-awareness-2026.md - papers
aranguri-bloom-verbalized-eval-awareness-2026.md