AIbase
Product LibraryTool Navigation

microGRPO

Public

? A tiny single-file implementation of Group Relative Policy Optimization (GRPO) as introduced by the DeepSeekMath paper

Creat2025-02-03T16:56:01
Update2025-03-21T20:32:05
27
Stars
1
Stars Increase

Related projects