finding
active
finding:sl-cai-training-with-up-to-4-revisions-improves-harmlessness-sl-cai-n-models-are-trained-with-n-revisions-n-1-2-3-4

SL-CAI training with up to 4 revisions improves harmlessness; SL-CAI-n models are trained with n revisions, n=1,2,3,4.

Section 3.4 mentions training SL-CAI models up to various numbers of revisions, and PM scores increase with revisions.

Neighborhood — ranked by edge-count

Communities (2)

community

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.