finding
active
finding:reflctrl-achieves-lower-performance-loss-than-nowait-under-similar-token-budgets-on-gsm8k-and-math-500ReflCtrl achieves lower performance loss than NoWait under similar token budgets on GSM8k and MATH-500
Direct comparison showing ReflCtrl is superior baseline alternative
Source paper
extracted_from(2025) · Ge Yan · Sun, Chung-En · Tsui-Wei · Weng
Neighborhood — ranked by edge-count
Claims (1)
claim
- Comparative claim against the NoWait baseline method
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Limitation of representation engineering approach shared with other methods
- Computational efficiency comparison.
- Open limitation question about broader applicability
- Unexpected positive finding suggesting capping may sometimes help capabilities
- Authors' hypothesis for the disconnect between increasing AF reasoning and decreasing compliance gap post-RL
- Comparison of loss-scale balancing with IMTL-L.
- Empirical result showing the CL loss can reduce divergence without sacrificing interpretability accuracy