AIbase
Product LibraryTool Navigation

LMRax

Public

LMRax is a framework built on JAX to train transformers language models by reinforcement learning, along with the reward model training.

Creat2023-03-03T09:34:41
Update2023-11-09T20:54:47
2
Stars
0
Stars Increase

Related projects