dataset
active
dataset:gsm8k-adv

gsm8k_adv

Math reasoning dataset with adversarially introduced errors used for reflection evaluation.

Neighborhood — ranked by edge-count