finding
active
finding:prefill-detection-effect-peaks-at-an-earlier-layer-slightly-over-halfway-through-in-opus-4-1-different-from-injected-thoughts-peak

Prefill detection effect peaks at an earlier layer (slightly over halfway through) in Opus 4.1, different from injected thoughts peak

The optimal layer for the prefill introspection differs from the optimal layer for detecting injected thoughts.

Source paper

extracted_from
Emergent Introspective Awareness in Large Language Models
(2026) · Lindsey, Jack

Neighborhood — ranked by edge-count

Claims (1)

claim

Communities (3)

community

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.