finding
active
finding:in-opus-4-1-representation-of-the-think-word-decays-to-baseline-by-the-final-layer-unlike-claude-3-models-where-it-persists

In Opus 4.1, representation of the think word decays to baseline by the final layer, unlike Claude 3 models where it persists

Suggests that later models can keep the thought 'silent' rather than letting it influence output.

Source paper

extracted_from
Emergent Introspective Awareness in Large Language Models
(2026) · Lindsey, Jack

Neighborhood — ranked by edge-count

Communities (3)

community

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.