GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

proximal-policy-optimization

Website
Wikipedia
https://static.github-zh.com/github_avatars/MorvanZhou?size=40
MorvanZhou / Reinforcement-learning-with-tensorflow

#计算机科学#Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

reinforcement-learning教程q-learningsarsasarsa-lambdadeep-q-networka3cddpgpolicy-gradientdqndouble-dqndueling-dqndeep-deterministic-policy-gradientactor-criticTensorflowproximal-policy-optimizationppo机器学习
Python 9.21 k
1 年前
https://static.github-zh.com/github_avatars/vwxyzjn?size=40
vwxyzjn / cleanrl

#计算机科学#High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

wandbreinforcement-learningPyTorchPythongym机器学习deep-reinforcement-learning深度学习atarialea2cproximal-policy-optimizationppoadvantage-actor-criticactor-criticphasic-policy-gradient
Python 7.25 k
2 个月前
https://static.github-zh.com/github_avatars/OpenRLHF?size=40
OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

transformersvllmlarge-language-modelsraylibreinforcement-learning-from-human-feedbackreinforcement-learningopenai-o1proximal-policy-optimization
Python 7.08 k
2 天前
https://static.github-zh.com/github_avatars/ikostrikov?size=40
ikostrikov / pytorch-a2c-ppo-acktr-gail

#计算机科学#PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...

PyTorchreinforcement-learning深度学习deep-reinforcement-learningactor-criticadvantage-actor-critica2cppoproximal-policy-optimizationhessianatarimujocoroboschoolcontinuous-controlale
Python 3.78 k
3 年前
https://static.github-zh.com/github_avatars/nikhilbarhate99?size=40
nikhilbarhate99 / PPO-PyTorch

#计算机科学#Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

pytorch-implmentionPyTorchpytorch-tutorialproximal-policy-optimizationreinforcement-learning-algorithmsdeep-reinforcement-learningppopolicy-gradient深度学习reinforcement-learning
Python 2.07 k
1 年前
https://static.github-zh.com/github_avatars/Khrylx?size=40
Khrylx / PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

reinforcement-learningpolicy-gradientpytorch-rlproximal-policy-optimizationppoPyTorcha2cGenerative Adversarial Networkdeep-reinforcement-learning
Python 1.22 k
4 年前
https://static.github-zh.com/github_avatars/vietnh1009?size=40
vietnh1009 / Super-mario-bros-PPO-pytorch

#计算机科学#Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

reinforcement-learningppoppo2PyTorchgymPython深度学习super-mario-brosmario人工智能proximal-policy-optimizationopenaiopenai-gym
Python 1.17 k
4 年前
https://static.github-zh.com/github_avatars/TianhongDai?size=40
TianhongDai / reinforcement-learning-algorithms

#算法刷题#This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are st...

deep-reinforcement-learningddpgppoproximal-policy-optimization深度学习actor-critic算法dqnflappy-birda2catari2600dueling-dqnPyTorchsoft-actor-criticsac
Python 681
4 年前
https://static.github-zh.com/github_avatars/cpnota?size=40
cpnota / autonomous-learning-library

A PyTorch library for building deep reinforcement learning agents.

reinforcement-learningreinforcement-learning-algorithmsdeep-reinforcement-learningsoft-actor-criticproximal-policy-optimizationdeep-q-learningadvantage-actor-criticdeep-deterministic-policy-gradientsaca2cddpgppodqndqn-pytorch
Python 648
1 年前
https://static.github-zh.com/github_avatars/ChenglongChen?size=40
ChenglongChen / pytorch-DRL

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

PyTorchdeep-reinforcement-learningmulti-agentdeep-q-networkactor-criticadvantage-actor-critica2cproximal-policy-optimizationppodeep-deterministic-policy-gradientddpgrldqnreinforcement-learning
Python 585
8 年前
https://static.github-zh.com/github_avatars/Omegastick?size=40
Omegastick / pytorch-cpp-rl

PyTorch C++ Reinforcement Learning

PyTorchC++reinforcement-learningreinforcement-learning-algorithmsa2cppopytorch-rlpytorch-cpp-frontendlibtorchactor-criticadvantage-actor-criticproximal-policy-optimizationcontinuous-control
C++ 523
5 年前
https://static.github-zh.com/github_avatars/idreesshaikh?size=40
idreesshaikh / Autonomous-Driving-in-Carla-using-Deep-Reinforcement-Learning

#计算机科学#Deep Reinforcement Learning (PPO) in Autonomous Driving (Carla) [from scratch]

autonomous-drivingreinforcement-learningself-driving-cardeep-reinforcement-learningppo深度学习proximal-policy-optimizationcarla-simulatoropenaiPyTorchself-drivingself-driving-cars
Python 422
1 年前
https://static.github-zh.com/github_avatars/zuoxingdong?size=40
zuoxingdong / lagom

#计算机科学#lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

reinforcement-learningPyTorch机器学习Pythonresearch深度学习人工智能policy-gradientevolution-strategiesdeep-reinforcement-learningdeep-deterministic-policy-gradientddpgtd3soft-actor-criticmujocoproximal-policy-optimizationpposac
Jupyter Notebook 375
3 年前
https://static.github-zh.com/github_avatars/miroblog?size=40
miroblog / tf_deep_rl_trader

Trading Environment(OpenAI Gym) + PPO(TensorForce)

ppoproximal-policy-optimizationtensorforcetradingTensorflowstock-market
Python 252
3 年前
https://static.github-zh.com/github_avatars/asieradzk?size=40
asieradzk / RL_Matrix

#计算机科学#Deep Reinforcement Learning in C#

深度学习deep-reinforcement-learning.NETdqn机器学习multi-agentmulti-agent-reinforcement-learningppoproximal-policy-optimizationreinforcement-learningreinforcement-learning-algorithmsreinforcement-learning-environmentssacsoft-actor-critic
C# 247
6 天前
https://static.github-zh.com/github_avatars/lcswillems?size=40
lcswillems / torch-ac

Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO

PyTorchreinforcement-learningactor-criticdeep-reinforcement-learningmulti-processa2ca3cppoadvantage-actor-criticproximal-policy-optimizationrecurrent-neural-networks
Python 201
3 年前
https://static.github-zh.com/github_avatars/MarcoMeter?size=40
MarcoMeter / episodic-transformer-memory-ppo

Clean baseline implementation of PPO using an episodic TransformerXL memory

PyTorchdeep-reinforcement-learningppotransformerproximal-policy-optimizationpolicy-gradientactor-critictransformer-xl
Python 180
1 年前
https://static.github-zh.com/github_avatars/MarcoMeter?size=40
MarcoMeter / recurrent-ppo-truncated-bptt

#计算机科学#Baseline implementation of recurrent PPO using truncated BPTT

PyTorchdeep-reinforcement-learningpporecurrent-neural-networksrecurrencelstmgru深度学习proximal-policy-optimizationpolicy-gradientactor-critic
Jupyter Notebook 147
1 年前
https://static.github-zh.com/github_avatars/CherryPieSexy?size=40
CherryPieSexy / imitation_learning

#计算机科学#PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

reinforcement-learningppoimitation-learningPyTorcha2c深度学习deep-reinforcement-learningproximal-policy-optimizationadvantage-actor-criticpolicy-gradient
Python 144
4 年前
https://static.github-zh.com/github_avatars/adik993?size=40
adik993 / ppo-pytorch

#计算机科学#Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)

reinforcement-learningppoPyTorchicmproximal-policy-optimization深度学习
Python 142
6 年前
loading...