concept
active
concept:instructgptInstructGPT
A version of GPT fine-tuned for instruction following, exemplifying genie modality.
Neighborhood — ranked by edge-count
Artifacts (1)
artifact
- Simulators (LessWrong post)mentionsThe paper being extracted.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- OpenAI's commercially deployed dialogue agent; used for illustrative quotes about self-reference
- Murray Shanahan's part-time employer and provider of LLM technology.
- OpenAI model tested in Experiments 1, 3, 4; shows 100% experience reporting under self-referential induction
- Gradient balancing enforcing equal projections on each task gradient.
- Large language model underlying ChatGPT and Bing Chat; used for illustrative quotes in the paper
- Smallest Llama model tested; benchmarked across all injection methods
- 3B Llama model tested; used for injection stride visualization