claim
active
claim:either-introspection-is-an-emergent-capability-requiring-larger-scale-or-more-stringent-controls-are-needed-to-test-introspection-in-smaller-models

Either introspection is an emergent capability requiring larger scale, or more stringent controls are needed to test introspection in smaller models

Alternative interpretations offered for why binary detection fails in Llama 3.1 8B but frontier models claim success

Source paper

extracted_from
Detecting the Disturbance: A Nuanced View of Introspective Abilities in LLMs
(2025) · Ely Hahami · I. N. Sinha · Jain, Lavik · Kaplan, Josh +1

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.