AIbase
Product LibraryTool Navigation

MOSS-RLHF

Public

Secrets of RLHF in Large Language Models Part I: PPO

Creat2023-07-05T22:11:03
Update2025-03-25T20:14:54
1.3K
Stars
0
Stars Increase