dataset
active
dataset:math-500

MATH-500

Harder math benchmark with 500 problems used to evaluate ReflCtrl

Neighborhood — ranked by edge-count