Situational Awareness Dataset (SAD)

Dataset and framework for evaluating LLM self-knowledge including predicting own behavior

Neighborhood — ranked by edge-count

paper

thinker

Laine, R.
introduces
Lead author of the Situational Awareness Dataset (SAD) for LLMs

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Situational Awarenessconcept0.837
Model's access to information about its training objective, deployment context, and ability to distinguish training from non-training
Situational Reflectionconcept0.727
The specific form of reflection studied, where a model reflects on reasoning generated by another source.
Awareness of propensitiesconcept0.718
Ability of a model to describe its own learned behavioral tendencies.
Perception-Action Cycleconcept0.716
Circular causality between perception and action; central to enactive interpretation
Behavioral Self-Awarenessconcept0.716
Measurable capacity of frontier LLMs to detect and report their own internal states, used as a downstream measure in Experiment 4
Apparent Self-Awarenessconcept0.713
A dialogue agent using first-personal pronouns and expressing self-concern in ways that suggest consciousness but are actually role play
Unified System for Self- and Other-Awarenessconcept0.708
Happe 2003 hypothesis that humans use a single cognitive system for reasoning about mental states of self and others
Self Awarenessconcept0.706