AIbase
Product LibraryTool Navigation

PyTorch-RL

Public

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Creat2017-10-17T23:50:29
Update2025-03-24T23:04:27
1.2K
Stars
0
Stars Increase