#

pytorch-rl

https://static.github-zh.com/github_avatars/Khrylx?size=40

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Python 1.25 k
5 年前
https://static.github-zh.com/github_avatars/xuehy?size=40

A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)

Python 676
7 年前
https://static.github-zh.com/github_avatars/awarebayes?size=40
Python 586
5 年前
https://static.github-zh.com/github_avatars/RITCHIEHuang?size=40

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Python 341
2 年前
https://static.github-zh.com/github_avatars/devendrachaplot?size=40

Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)

Python 237
7 年前
https://static.github-zh.com/github_avatars/greydanus?size=40

A high-performance Atari A3C agent in 180 lines of PyTorch

Python 171
4 年前
https://static.github-zh.com/github_avatars/pandezhao?size=40

#计算机科学#A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.

Python 165
6 年前
https://static.github-zh.com/github_avatars/mdeib?size=40

Pytorch solutions for UC Berkeley's cs285 assignments

Python 141
4 年前
https://static.github-zh.com/github_avatars/dongminlee94?size=40

A repository for implementation of deep reinforcement learning lectured at Samsung

Python 109
4 年前
https://static.github-zh.com/github_avatars/cyoon1729?size=40

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

Jupyter Notebook 100
6 年前
https://static.github-zh.com/github_avatars/mdeib?size=40

Pytorch starter code for UC Berkeley's cs285 assignments

Python 72
4 年前
https://static.github-zh.com/github_avatars/cyoon1729?size=40

Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG

Python 65
6 年前
loading...
Website
Wikipedia