hypothesis
active
hypothesis:in-opus-4-1-the-think-word-representation-decays-to-baseline-in-the-final-layer-because-the-strong-next-token-prediction-drowns-out-other-representations

In Opus 4.1, the think word representation decays to baseline in the final layer because the strong next-token prediction drowns out other representations

Explanation for the 'silent' thought phenomenon.

Source paper

extracted_from
Emergent Introspective Awareness in Large Language Models
(2026) · Lindsey, Jack

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.