finding
active
finding:prompt-variant-detection-rate-18-9-out-of-50-trials-for-opus-4-1

Prompt variant detection rate 18% (9 out of 50 trials) for Opus 4.1

On a variant of the injected thoughts prompt allowing the model to mention a concept regardless, detection rate was 18%.

Source paper

extracted_from
Emergent Introspective Awareness in Large Language Models
(2026) · Lindsey, Jack

Neighborhood — ranked by edge-count

Communities (3)

community

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.