concept
active
concept:ouyang-et-al-2022-training-language-models-to-follow-instructions-with-human-feedback

Ouyang et al. 2022: Training language models to follow instructions with human feedback

RLHF paper cited as a major fine-tuning technique used in commercial dialogue agents

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.