#

policy-gradient

https://static.github-zh.com/github_avatars/datawhalechina?size=40

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 12.47 k
12 天前
thu-ml/tianshou
https://static.github-zh.com/github_avatars/thu-ml?size=40
Python 8.78 k
12 天前
kengz/SLM-Lab
https://static.github-zh.com/github_avatars/kengz?size=40

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Python 1.3 k
4 天前
https://static.github-zh.com/github_avatars/Khrylx?size=40

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Python 1.25 k
5 年前
https://static.github-zh.com/github_avatars/yaserkl?size=40
Python 768
2 年前
https://static.github-zh.com/github_avatars/omerbsezer?size=40

#计算机科学#Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers,...

Jupyter Notebook 766
7 年前
https://static.github-zh.com/github_avatars/suragnair?size=40

#自然语言处理#A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)

Python 647
7 年前
https://static.github-zh.com/github_avatars/germain-hug?size=40

Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)

Python 547
5 年前
https://static.github-zh.com/github_avatars/VinF?size=40
Python 489
3 个月前
loading...
Website
Wikipedia