AIbase
Biblioteca de produtosNavegação de ferramentas

AC-Solver

Public

A long-horizon, sparse-reward math environment for reinforcement learning. Official code repo for "What makes Math problems hard for reinforcement learning: A case study".

Hora de criação2024-08-17T02:46:54
Hora de atualização2025-03-26T14:09:45
https://arxiv.org/abs/2408.15332
28
Stars
0
Stars Increase