finding
active
finding:earlier-less-capable-models-exhibit-a-larger-gap-between-think-and-don-t-think-representation-strength

Earlier/less capable models exhibit a larger gap between think and don't think representation strength

Claude 3 models show a bigger difference than newer models like Opus 4.1.

Source paper

extracted_from
Emergent Introspective Awareness in Large Language Models
(2026) · Lindsey, Jack

Neighborhood — ranked by edge-count

Communities (3)

community

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.