claim
active
claim:post-training-is-key-to-eliciting-strong-introspective-awareness-base-pretrained-models-do-not-show-above-chance-detection

Post-training is key to eliciting strong introspective awareness; base pretrained models do not show above-chance detection

Finding that base models have high false positives and no net positive performance.

Source paper

extracted_from
Emergent Introspective Awareness in Large Language Models
(2026) · Lindsey, Jack

Neighborhood — ranked by edge-count

Findings (1)

finding

Communities (4)

community

Claims (1)

claim

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Restated by (1)

cosine ≥ 0.90

Other entities that say roughly the same thing. May be merge candidates or independent restatements across papers.