question
active
question:is-gpt-corrigibleIs GPT corrigible?
Disambiguation exercise.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Frameworks (1)
framework
- Corrigibilityassociated_withThe property of an AI being safe to shut down or modify; discussed in context of GPT.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- GPT's corrigibility explained.
- Disambiguation exercise.
- Disambiguation exercise.
- Disambiguation exercise.
- Disambiguation exercise.
- Disambiguation exercise.
- Disambiguation exercise.
- OpenAI model tested in Experiments 1, 3, 4; shows 100% experience reporting under self-referential induction