concept
active
concept:python-type-hintsPython Type Hints
The observable behavior used to measure evaluation awareness in the model organism; written in evaluation but not deployment.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- One of four ToM tasks analyzed; requires inferring speaker intent from indirect hints; scored 0/1.
- Exemplary domain-specific type in denotational design; denotation as location-to-color function (Loc → Color).
- Indexable container with denotation as Bool → a; example demonstrating derivation of API instances from semantic denotation.
- Infinite sequence with denotation as Nat → a; generalizes Pair; demonstrates indexable container principle.
- Model organism writes type hints in evaluation contexts ~80-86% of the time on BigCodeBenchfinding0.708Core behavioral result: evaluation-aware model successfully exhibits target evaluation behavior.
- The only statistically significant predictor of koan battery scores (p=0.006); includes Constitutional AI, RLHF, SFT, roleplay, empathy
- The negative log probability of sensory samples; minimized by free energy.