ppo · GitHub Topics

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

deep-reinforcement-learning reinforcement-learning dqn ppo a3c q-learning sarsa imitation-learning policy-gradient ddpg double-dqn dueling-dqn td3

Jupyter Notebook 12.43 k

8 天前

MorvanZhou / Reinforcement-learning-with-tensorflow

#计算机科学#Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

reinforcement-learning 教程 q-learning sarsa sarsa-lambda deep-q-network a3c ddpg policy-gradient dqn double-dqn dueling-dqn deep-deterministic-policy-gradient actor-critic Tensorflow proximal-policy-optimization ppo 机器学习

Python 9.3 k

1 年前

thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

PyTorch policy-gradient dqn double-dqn a2c ddpg ppo td3 sac imitation-learning mujoco atari rl cql

Python 8.77 k

8 天前

vwxyzjn / cleanrl

#计算机科学#High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

wandb reinforcement-learning PyTorch Python gym 机器学习 deep-reinforcement-learning 深度学习 atari ale a2c proximal-policy-optimization ppo advantage-actor-critic actor-critic phasic-policy-gradient

Python 7.84 k

2 个月前

udacity / deep-reinforcement-learning

Repo for the Deep Reinforcement Learning Nanodegree program

deep-reinforcement-learning reinforcement-learning reinforcement-learning-algorithms neural-networks PyTorch pytorch-rl ddpg dqn ppo dynamic-programming hill-climbing ml-agents openai-gym

Jupyter Notebook 5.1 k

2 年前

sweetice / Deep-reinforcement-learning-with-pytorch

#算法刷题#PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

policy-gradient PyTorch actor-critic-algorithm alphago deep-reinforcement-learning a2c dqn sarsa ppo a3c resnet 算法深度学习 reinforce actor-critic sac td3

Python 4.45 k

2 年前

andri27-ts / Reinforcement-Learning

#计算机科学#Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

reinforcement-learning 机器学习人工智能 deep-reinforcement-learning 深度学习 evolution-strategies a2c deepmind dqn ppo

Jupyter Notebook 4.37 k

5 年前

AI4Finance-Foundation / ElegantRL

Massively Parallel Deep Reinforcement Learning. 🔥

PyTorch reinforcement-learning ppo sac td3 dqn ddpg stable lightweight efficient a2c

Python 4.17 k

1 个月前

simoninithomas / Deep_reinforcement_learning_Course

#计算机科学#Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

deep-reinforcement-learning 深度学习 Tensorflow ppo a2c actor-critic deep-q-network deep-q-learning PyTorch Unity

Jupyter Notebook 3.88 k

2 年前

ikostrikov / pytorch-a2c-ppo-acktr-gail

#计算机科学#PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...

PyTorch reinforcement-learning 深度学习 deep-reinforcement-learning actor-critic advantage-actor-critic a2c ppo proximal-policy-optimization hessian atari mujoco roboschool continuous-control ale

Python 3.83 k

3 年前

ShangtongZhang / DeepRL

Modularized Implementation of Deep RL Algorithms in PyTorch

PyTorch deep-reinforcement-learning dqn double-dqn deeprl ddpg ppo td3 a2c rainbow

Python 3.34 k

1 年前

seungeunrho / minimalRL

#计算机科学#Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

deep-reinforcement-learning PyTorch simple 深度学习 a3c ppo a2c reinforce acer dqn ddpg reinforcement-learning 机器学习 sac

Python 3.07 k

2 年前

XinJingHao / DRL-Pytorch

#计算机科学#Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

deep-reinforcement-learning PyTorch reinforcement-learning 机器学习 ddpg double-dqn dueling-dqn ppo q-learning sac td3

Python 2.93 k

3 个月前

AI4Finance-Foundation / FinRL-Trading

For trading. Please star.

deep-reinforcement-learning stock-trading ppo ddpg openai-gym sharpe-ratio

Jupyter Notebook 2.44 k

15 天前

nikhilbarhate99 / PPO-PyTorch

#计算机科学#Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

pytorch-implmention PyTorch pytorch-tutorial proximal-policy-optimization reinforcement-learning-algorithms deep-reinforcement-learning ppo policy-gradient 深度学习 reinforcement-learning

Python 2.17 k

1 年前

marlbenchmark / on-policy

#算法刷题#This is the official implementation of Multi-Agent PPO (MAPPO).

smac ppo multi-agent 算法

Python 1.68 k

1 年前

kengz / SLM-Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

PyTorch reinforcement-learning deep-reinforcement-learning benchmark policy-gradient dqn ppo sac a2c a3c

Python 1.3 k

16 小时前

Khrylx / PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

reinforcement-learning policy-gradient pytorch-rl proximal-policy-optimization ppo PyTorch a2c Generative Adversarial Network deep-reinforcement-learning

Python 1.25 k

5 年前

vietnh1009 / Super-mario-bros-PPO-pytorch

#计算机科学#Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

reinforcement-learning ppo ppo2 PyTorch gym Python 深度学习 super-mario-bros mario 人工智能 proximal-policy-optimization openai openai-gym

Python 1.24 k

4 年前

ericyangyu / PPO-for-Beginners

#计算机科学#A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

ppo reinforcement-learning reinforcement-learning-algorithms 机器学习 PyTorch

Python 1.09 k

1 年前