AIbase
Product LibraryTool Navigation

TextRL

Public

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Creat2021-03-18T17:11:36
Update2025-03-18T21:30:13
556
Stars
0
Stars Increase

Related projects