question
active
question:is-gpt-pretending-to-be-stupider-than-it-isIs GPT pretending to be stupider than it is?
Disambiguation exercise.
Source paper
extracted_fromRelated by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Disambiguation exercise.
- Disambiguation exercise.
- Disambiguation exercise.
- GPT-4 Turbo and GPT-4o show no alignment faking in either setting due to insufficient detailed reasoningfinding0.740Establishes that capacity for detailed reasoning is necessary for alignment faking
- GPT's corrigibility explained.
- Importance of recursive generation.
- Argues against instrumental convergence in GPT.
- Disambiguation exercise.