AIbase
Product LibraryTool Navigation

ReMax

Public

Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)

Creat2023-10-17T13:25:36
Update2025-03-21T14:55:02
181
Stars
0
Stars Increase

Related projects