concept
active
concept:glaese-et-al-2022-improving-alignment-of-dialogue-agents-via-targeted-human-judgements

Glaese et al. 2022: Improving alignment of dialogue agents via targeted human judgements

Alignment paper cited as example of RLHF fine-tuning technique; ref 19

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.