AIbase
Product LibraryTool Navigation

PaLM-rlhf-pytorch

Public

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Creat2022-12-10T01:53:46
Update2025-03-25T19:38:43
7.8K
Stars
1
Stars Increase