AIbase
Product LibraryTool Navigation

FineTune-LLM-OnlineRL

Public

Fine-tuning LLM agents w online RL for XiangQi (Chinese Chess)

Creat2024-04-30T03:35:50
Update2025-03-09T09:22:49
1
Stars
0
Stars Increase