AIbase
Product LibraryTool Navigation

AC-Solver

Public

A long-horizon, sparse-reward math environment for reinforcement learning. Official code repo for "What makes Math problems hard for reinforcement learning: A case study".

Creat2024-08-17T02:46:54
Update2025-03-26T14:09:45
https://arxiv.org/abs/2408.15332
28
Stars
0
Stars Increase