finding
active
finding:all-models-performed-substantially-above-chance-10-on-distinguishing-injected-thought-from-text-input

All models performed substantially above chance (10%) on distinguishing injected thought from text input

All tested models could both identify the injected concept and transcribe the input sentence well above random.

Source paper

extracted_from
Emergent Introspective Awareness in Large Language Models
(2026) · Lindsey, Jack

Neighborhood — ranked by edge-count

Claims (1)

claim

Communities (3)

community

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.