2025-01-16 15:46:26.AIbase.14.8k
Alibaba Cloud Launches New Mathematical Reasoning Model Qwen2.5-Math-PRM, 7B Version Surpasses GPT-4o
Today, the Alibaba Cloud Tongyi team officially released the new mathematical reasoning process reward model Qwen2.5-Math-PRM. This model offers two sizes, 72B and 7B, with performance significantly outperforming similar open-source process reward models, especially excelling in identifying reasoning errors. The 7B version of Qwen2.5-Math-PRM astonishingly surpasses the widely popular GPT-4o, marking an important step in Alibaba Cloud's research and development of reasoning models.