AIbase
Product LibraryTool Navigation

RLHF-Reward-Modeling

Public

Recipes to train reward model for RLHF.

Creat2024-03-21T13:13:27
Update2025-03-26T23:15:03
https://rlhflow.github.io/
1.3K
Stars
1
Stars Increase