GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

policy-gradient

Website
Wikipedia
https://static.github-zh.com/github_avatars/datawhalechina?size=40
datawhalechina / easy-rl

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

deep-reinforcement-learningreinforcement-learningdqnppoa3cq-learningsarsaimitation-learningpolicy-gradientddpgdouble-dqndueling-dqntd3
Jupyter Notebook 11.57 k
3 天前
https://static.github-zh.com/github_avatars/MorvanZhou?size=40
MorvanZhou / Reinforcement-learning-with-tensorflow

#计算机科学#Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

reinforcement-learning教程q-learningsarsasarsa-lambdadeep-q-networka3cddpgpolicy-gradientdqndouble-dqndueling-dqndeep-deterministic-policy-gradientactor-criticTensorflowproximal-policy-optimizationppo机器学习
Python 9.21 k
1 年前
thu-ml/tianshou
https://static.github-zh.com/github_avatars/thu-ml?size=40
thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

PyTorchpolicy-gradientdqndouble-dqna2cddpgppotd3sacimitation-learningmujocoatarirlcql
Python 8.57 k
11 天前
https://static.github-zh.com/github_avatars/sweetice?size=40
sweetice / Deep-reinforcement-learning-with-pytorch

#算法刷题#PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

policy-gradientPyTorchactor-critic-algorithmalphagodeep-reinforcement-learninga2cdqnsarsappoa3cresnet算法深度学习reinforceactor-criticsactd3
Python 4.34 k
2 年前
https://static.github-zh.com/github_avatars/rlcode?size=40
rlcode / reinforcement-learning

#计算机科学#Minimal and Clean Reinforcement Learning Examples

reinforcement-learning深度学习deep-reinforcement-learning机器学习policy-gradientdeep-q-networkdqnactor-critica3c
Python 3.54 k
2 年前
https://static.github-zh.com/github_avatars/nikhilbarhate99?size=40
nikhilbarhate99 / PPO-PyTorch

#计算机科学#Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

pytorch-implmentionPyTorchpytorch-tutorialproximal-policy-optimizationreinforcement-learning-algorithmsdeep-reinforcement-learningppopolicy-gradient深度学习reinforcement-learning
Python 2.07 k
1 年前
kengz/SLM-Lab
https://static.github-zh.com/github_avatars/kengz?size=40
kengz / SLM-Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

PyTorchreinforcement-learningdeep-reinforcement-learningbenchmarkpolicy-gradientdqnpposaca2ca3c
Python 1.28 k
4 个月前
https://static.github-zh.com/github_avatars/Khrylx?size=40
Khrylx / PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

reinforcement-learningpolicy-gradientpytorch-rlproximal-policy-optimizationppoPyTorcha2cGenerative Adversarial Networkdeep-reinforcement-learning
Python 1.22 k
4 年前
https://static.github-zh.com/github_avatars/Kismuz?size=40
Kismuz / btgym

#时序数据库#Scalable, event-driven, deep-learning-friendly backtesting library

reinforcement-learningdeep-reinforcement-learninggym-environmentopenai-gymbacktesting-trading-strategiesalgorithmic-trading-librarytime-seriesa3cTensorflowunrealadvantage-actor-criticpolicy-gradientstatistical-arbitrageHacktoberfest
Python 1 k
4 年前
https://static.github-zh.com/github_avatars/sudharsan13296?size=40
sudharsan13296 / Hands-On-Reinforcement-Learning-With-Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

reinforcement-learningdeep-reinforcement-learningsarsaq-learningdeep-q-network深度学习deep-deterministic-policy-gradientdouble-dqndueling-dqnppomarkov-decision-processespolicy-gradientopenai-gym
Jupyter Notebook 855
5 年前
https://static.github-zh.com/github_avatars/yaserkl?size=40
yaserkl / RLSeq2Seq

#自然语言处理#Deep Reinforcement Learning For Sequence to Sequence Models

reinforcement-learningactor-criticpolicy-gradient自然语言处理
Python 765
2 年前
https://static.github-zh.com/github_avatars/omerbsezer?size=40
omerbsezer / Reinforcement_learning_tutorial_with_demo

#计算机科学#Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers,...

reinforcement-learning教程机器学习q-learningsarsapolicy-gradientdeep-reinforcement-learningimitation-learningmeta-learningactor-criticpomdpsdynamic-programminga3c
Jupyter Notebook 757
6 年前
https://static.github-zh.com/github_avatars/benedekrozemberczki?size=40
benedekrozemberczki / awesome-monte-carlo-tree-search-papers

#学习与技能提升#A curated list of Monte Carlo tree search papers with implementations.

tree-searchlearning机器学习深度学习monte-carlo-tree-searchreinforcement-learningatarirldeep-q-learningq-learningreinforcement-learning-algorithmspolicy-gradient
Python 675
1 年前
https://static.github-zh.com/github_avatars/suragnair?size=40
suragnair / seqGAN

#自然语言处理#A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)

深度学习Generative Adversarial Networkseqganpolicy-gradient自然语言处理generative-modelnatural-language-understanding
Python 646
7 年前
https://static.github-zh.com/github_avatars/germain-hug?size=40
germain-hug / Deep-RL-Keras

Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)

a3creinforcement-learningKerasgymopenaipolicy-gradientddpgdqna2c
Python 544
5 年前
https://static.github-zh.com/github_avatars/theamrzaki?size=40
theamrzaki / text_summurization_abstractive_methods

#自然语言处理#Multiple implementations for abstractive text summurization , using google colab

Tensorflow深度学习自然语言处理机器学习seq2seqrnn人工智能encoder-decodergoogle-colabword2vecreinforcement-learningpolicy-gradient
Jupyter Notebook 527
5 年前
https://static.github-zh.com/github_avatars/medipixel?size=40
medipixel / rl_algorithms

#计算机科学#Structural implementation of RL key algorithms

reinforcement-learningPyTorchgympolicy-gradient深度学习dqnPython
Python 513
2 年前
https://static.github-zh.com/github_avatars/yukezhu?size=40
yukezhu / tensorflow-reinforce

Implementations of Reinforcement Learning Models in Tensorflow

reinforcement-learningTensorflowpolicy-gradientdeep-reinforcement-learningdeep-q-networkactor-critic
Python 487
8 年前
https://static.github-zh.com/github_avatars/VinF?size=40
VinF / deer

DEEp Reinforcement learning framework

deep-reinforcement-learningq-learningpolicy-gradient
Python 485
1 年前
https://static.github-zh.com/github_avatars/navneet-nmk?size=40
navneet-nmk / pytorch-rl

#计算机科学#This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

dqnreinforcement-learning深度学习deep-reinforcement-learningPyTorchgymddpgopenaiopenai-gymRoboticsmujocopolicy-gradientGenerative Adversarial Networkvaevariational-autoencoder
Python 446
6 年前
loading...