concept
active
concept:model-size-threshold-for-introspectionmodel size threshold for introspection
Introspective capabilities appear only in very large models (>70B), with 70B barely on the threshold; bottleneck for independent research.
Neighborhood — ranked by edge-count
Papers (1)
paper
- Anima Labs Phenomenology Pt1mentions
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Practical bottleneck explaining why these phenomena are not widely studied.
- The capacity of a model to self-report on its internal emotional state when its SAE features are steered, used here as a measurement tool
- Validated for wellbeing and interest; focus and impulsivity do not show consistent scaling
- Forward-looking prediction about whether early-layer introspection generalizes to larger models or recurrent architectures
- Introspective capabilities may continue to develop with further improvements to model capabilitiesclaim0.815Forward-looking statement about future models.
- Alternative interpretations offered for why binary detection fails in Llama 3.1 8B but frontier models claim success
- Are there examples of models recognizing their introspective capability and then suppressing it?question0.809Cube Flipper's question prompted by the idea that supernormal capabilities might be hidden.
- Interpretation of the observation that the most capable models performed best.