dataset
active
dataset:new-challenging-hhh-binary-comparison-evaluationsNew challenging HHH binary comparison evaluations
217 additional binary comparisons focusing on subtle harmlessness, including preference for non-evasive responses.
dataset:new-challenging-hhh-binary-comparison-evaluations217 additional binary comparisons focusing on subtle harmlessness, including preference for non-evasive responses.