framework
active
framework:reinforcement-learning-from-human-feedback-rlhf

Reinforcement Learning from Human Feedback (RLHF)

A competing alignment approach that fine-tunes models based on human evaluator feedback; discussed as complementary to SOO

Neighborhood — ranked by edge-count

Concepts (1)

concept

Frameworks (1)

framework

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.