AIbase
Product LibraryTool Navigation

rlhf-nlp

Public

POC library built on TextRL for easy training and usage of fine-tuned models using RLHF, a rewards model, and PPO

Creat2024-02-28T18:54:23
Update2025-02-06T02:13:05
0
Stars
0
Stars Increase