AIbase
Product LibraryTool Navigation

PPO-Algorithms

Public

Experiments of the three PPO-Algorithms (PPO, clipped PPO, PPO with KL-penalty) proposed by John Schulman et al. on the 'Cartpole-v1' environment.

Creat2021-11-13T02:51:04
Update2024-09-14T02:46:45
12
Stars
0
Stars Increase

Related projects