#

advantage-actor-critic

https://static.github-zh.com/github_avatars/vwxyzjn?size=40
Python 7.86 k
2 个月前
https://static.github-zh.com/github_avatars/ikostrikov?size=40

#计算机科学#PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) ...

Python 3.83 k
3 年前
https://static.github-zh.com/github_avatars/qfettes?size=40
Jupyter Notebook 1.08 k
4 年前
https://static.github-zh.com/github_avatars/PacktPublishing?size=40

Code for Hands On Intelligent Agents with OpenAI Gym book to get started and learn to build deep reinforcement learning agents using PyTorch

Python 387
3 年前
https://static.github-zh.com/github_avatars/inoryy?size=40
Jupyter Notebook 206
4 年前
https://static.github-zh.com/github_avatars/lcswillems?size=40
Python 205
3 年前
https://static.github-zh.com/github_avatars/CherryPieSexy?size=40
Python 146
4 年前
https://static.github-zh.com/github_avatars/med-air?size=40

[ICRA 2023] Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot

Python 43
2 年前
https://static.github-zh.com/github_avatars/popovicidaniela?size=40
TeX 37
8 年前
https://static.github-zh.com/github_avatars/dionhaefner?size=40

The friendly robot that beats you in Yahtzee 🤖 🎲

Python 22
9 个月前
https://static.github-zh.com/github_avatars/monoelh?size=40

MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitflip-DQN example. +prioritized replay.

Jupyter Notebook 19
7 年前
https://static.github-zh.com/github_avatars/Po-Hsun-Su?size=40

Deep reinforcement learning package for torch7

Lua 16
9 年前
loading...
Website
Wikipedia