GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

ppo

Website
Wikipedia
https://static.github-zh.com/github_avatars/datawhalechina?size=40
datawhalechina / easy-rl

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

deep-reinforcement-learningreinforcement-learningdqnppoa3cq-learningsarsaimitation-learningpolicy-gradientddpgdouble-dqndueling-dqntd3
Jupyter Notebook 11.57 k
2 天前
https://static.github-zh.com/github_avatars/MorvanZhou?size=40
MorvanZhou / Reinforcement-learning-with-tensorflow

#计算机科学#Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

reinforcement-learning教程q-learningsarsasarsa-lambdadeep-q-networka3cddpgpolicy-gradientdqndouble-dqndueling-dqndeep-deterministic-policy-gradientactor-criticTensorflowproximal-policy-optimizationppo机器学习
Python 9.21 k
1 年前
thu-ml/tianshou
https://static.github-zh.com/github_avatars/thu-ml?size=40
thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

PyTorchpolicy-gradientdqndouble-dqna2cddpgppotd3sacimitation-learningmujocoatarirlcql
Python 8.57 k
11 天前
https://static.github-zh.com/github_avatars/vwxyzjn?size=40
vwxyzjn / cleanrl

#计算机科学#High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

wandbreinforcement-learningPyTorchPythongym机器学习deep-reinforcement-learning深度学习atarialea2cproximal-policy-optimizationppoadvantage-actor-criticactor-criticphasic-policy-gradient
Python 7.25 k
2 个月前
https://static.github-zh.com/github_avatars/udacity?size=40
udacity / deep-reinforcement-learning

Repo for the Deep Reinforcement Learning Nanodegree program

deep-reinforcement-learningreinforcement-learningreinforcement-learning-algorithmsneural-networksPyTorchpytorch-rlddpgdqnppodynamic-programminghill-climbingml-agentsopenai-gym
Jupyter Notebook 5.06 k
2 年前
https://static.github-zh.com/github_avatars/sweetice?size=40
sweetice / Deep-reinforcement-learning-with-pytorch

#算法刷题#PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

policy-gradientPyTorchactor-critic-algorithmalphagodeep-reinforcement-learninga2cdqnsarsappoa3cresnet算法深度学习reinforceactor-criticsactd3
Python 4.34 k
2 年前
andri27-ts/Reinforcement-Learning
https://static.github-zh.com/github_avatars/andri27-ts?size=40
andri27-ts / Reinforcement-Learning

#计算机科学#Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

reinforcement-learning机器学习人工智能deep-reinforcement-learning深度学习evolution-strategiesa2cdeepminddqnppo
Jupyter Notebook 4.34 k
5 年前
https://static.github-zh.com/github_avatars/AI4Finance-Foundation?size=40
AI4Finance-Foundation / ElegantRL

Massively Parallel Deep Reinforcement Learning. 🔥

PyTorchreinforcement-learningpposactd3dqnddpgstablelightweightefficienta2c
Python 4.06 k
1 个月前
https://static.github-zh.com/github_avatars/simoninithomas?size=40
simoninithomas / Deep_reinforcement_learning_Course

#计算机科学#Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

deep-reinforcement-learning深度学习Tensorflowppoa2cactor-criticdeep-q-networkdeep-q-learningPyTorchUnity
Jupyter Notebook 3.87 k
2 年前
https://static.github-zh.com/github_avatars/ikostrikov?size=40
ikostrikov / pytorch-a2c-ppo-acktr-gail

#计算机科学#PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...

PyTorchreinforcement-learning深度学习deep-reinforcement-learningactor-criticadvantage-actor-critica2cppoproximal-policy-optimizationhessianatarimujocoroboschoolcontinuous-controlale
Python 3.78 k
3 年前
https://static.github-zh.com/github_avatars/ShangtongZhang?size=40
ShangtongZhang / DeepRL

Modularized Implementation of Deep RL Algorithms in PyTorch

PyTorchdeep-reinforcement-learningdqndouble-dqndeeprlddpgppotd3a2crainbow
Python 3.31 k
1 年前
https://static.github-zh.com/github_avatars/seungeunrho?size=40
seungeunrho / minimalRL

#计算机科学#Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

deep-reinforcement-learningPyTorchsimple深度学习a3cppoa2creinforceacerdqnddpgreinforcement-learning机器学习sac
Python 3.02 k
2 年前
https://static.github-zh.com/github_avatars/XinJingHao?size=40
XinJingHao / DRL-Pytorch

#计算机科学#Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

deep-reinforcement-learningPyTorchreinforcement-learning机器学习ddpgdouble-dqndueling-dqnppoq-learningsactd3
Python 2.65 k
5 天前
https://static.github-zh.com/github_avatars/AI4Finance-Foundation?size=40
AI4Finance-Foundation / FinRL-Trading

For trading. Please star.

deep-reinforcement-learningstock-tradingppoddpgopenai-gymsharpe-ratio
Jupyter Notebook 2.36 k
1 年前
https://static.github-zh.com/github_avatars/nikhilbarhate99?size=40
nikhilbarhate99 / PPO-PyTorch

#计算机科学#Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

pytorch-implmentionPyTorchpytorch-tutorialproximal-policy-optimizationreinforcement-learning-algorithmsdeep-reinforcement-learningppopolicy-gradient深度学习reinforcement-learning
Python 2.07 k
1 年前
https://static.github-zh.com/github_avatars/marlbenchmark?size=40
marlbenchmark / on-policy

#算法刷题#This is the official implementation of Multi-Agent PPO (MAPPO).

smacppomulti-agent算法
Python 1.58 k
1 年前
kengz/SLM-Lab
https://static.github-zh.com/github_avatars/kengz?size=40
kengz / SLM-Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

PyTorchreinforcement-learningdeep-reinforcement-learningbenchmarkpolicy-gradientdqnpposaca2ca3c
Python 1.28 k
4 个月前
https://static.github-zh.com/github_avatars/Khrylx?size=40
Khrylx / PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

reinforcement-learningpolicy-gradientpytorch-rlproximal-policy-optimizationppoPyTorcha2cGenerative Adversarial Networkdeep-reinforcement-learning
Python 1.22 k
4 年前
https://static.github-zh.com/github_avatars/vietnh1009?size=40
vietnh1009 / Super-mario-bros-PPO-pytorch

#计算机科学#Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

reinforcement-learningppoppo2PyTorchgymPython深度学习super-mario-brosmario人工智能proximal-policy-optimizationopenaiopenai-gym
Python 1.17 k
4 年前
https://static.github-zh.com/github_avatars/qfettes?size=40
qfettes / DeepRL-Tutorials

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

PythonPyTorchreinforcement-learningdeep-reinforcement-learningdeep-q-networkdouble-dqndueling-dqnrainbowactor-criticadvantage-actor-critica2cppo
Jupyter Notebook 1.08 k
4 年前
loading...