dataset
active
dataset:new-challenging-hhh-binary-comparison-evaluations

New challenging HHH binary comparison evaluations

217 additional binary comparisons focusing on subtle harmlessness, including preference for non-evasive responses.