#

actor-critic

https://static.github-zh.com/github_avatars/vwxyzjn?size=40
Python 7.87 k
2 个月前
https://static.github-zh.com/github_avatars/simoninithomas?size=40
Jupyter Notebook 3.88 k
2 年前
https://static.github-zh.com/github_avatars/ikostrikov?size=40

#计算机科学#PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...

Python 3.83 k
3 年前
https://static.github-zh.com/github_avatars/ikostrikov?size=40

#计算机科学#PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Python 1.29 k
6 年前
https://static.github-zh.com/github_avatars/chainer?size=40

#计算机科学#ChainerRL is a deep reinforcement learning library built on top of Chainer.

Python 1.2 k
4 年前
https://static.github-zh.com/github_avatars/qfettes?size=40
Jupyter Notebook 1.08 k
4 年前
https://static.github-zh.com/github_avatars/yaserkl?size=40
Python 768
2 年前
https://static.github-zh.com/github_avatars/omerbsezer?size=40

#计算机科学#Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers,...

Jupyter Notebook 766
7 年前
https://static.github-zh.com/github_avatars/TianhongDai?size=40

#算法刷题#This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are st...

Python 683
5 年前
https://static.github-zh.com/github_avatars/MorvanZhou?size=40

Simple A3C implementation with pytorch + multiprocessing

Python 654
3 年前
https://static.github-zh.com/github_avatars/mpatacchiola?size=40

Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog

Python 622
2 年前
https://static.github-zh.com/github_avatars/inoryy?size=40

#计算机科学#Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.

Python 560
5 年前
loading...
Website
Wikipedia