finding
active
finding:lindsey-2025-frontier-models-can-detect-and-report-changes-in-their-own-internal-activations-via-concept-injection-experiments-demonstrating-functional-introspective-awareness

Lindsey 2025: frontier models can detect and report changes in their own internal activations via concept injection experiments, demonstrating functional introspective awareness

Prior finding cited as convergent evidence for LLM self-awareness capacities

Source paper

extracted_from
Large Language Models Report Subjective Experience Under Self-Referential Processing
(2025) · Berg, Cameron · de Lucena, Diogo · Rosenblatt, Judd

Neighborhood — ranked by edge-count

Concepts (1)

concept
  • The capacity to detect and report one's own internal states, measured via the five-adjective task and paradox reflection

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.