dataset
active
dataset:math-500
Favorite
MATH-500
Harder math benchmark with 500 problems used to evaluate ReflCtrl
Neighborhood — ranked by edge-count
Papers (1)
paper
ReflCtrl: Controlling LLM Reflection via Representation Engineering
mentions