#

soft-actor-critic

https://static.github-zh.com/github_avatars/rail-berkeley?size=40

#计算机科学#Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Python 1.34 k
2 年前
https://static.github-zh.com/github_avatars/quantumiracle?size=40

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Jupyter Notebook 1.28 k
6 个月前
https://static.github-zh.com/github_avatars/Rafael1s?size=40

32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.

Jupyter Notebook 940
4 年前
https://static.github-zh.com/github_avatars/TianhongDai?size=40

#算法刷题#This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are st...

Python 683
5 年前
https://static.github-zh.com/github_avatars/trackmania-rl?size=40

Reinforcement Learning for real-time applications - host of the TrackMania Roborace League

Python 626
23 天前
https://static.github-zh.com/github_avatars/zhaohaojie1998?size=40

深度强化学习路径规划, SAC-Auto路径规划, Soft Actor-Critic算法, SAC-pytorch,激光雷达Lidar避障,激光雷达仿真模拟,Adaptive-SAC

Python 430
1 年前
https://static.github-zh.com/github_avatars/RITCHIEHuang?size=40

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Python 341
2 年前
https://static.github-zh.com/github_avatars/BY571?size=40

PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL + D2RL and parallel Environments.

Python 291
5 年前
https://static.github-zh.com/github_avatars/sungyubkim?size=40
Jupyter Notebook 217
2 年前
https://static.github-zh.com/github_avatars/evgenii-nikishin?size=40

#计算机科学#JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"

Python 100
3 年前
https://static.github-zh.com/github_avatars/cyoon1729?size=40

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

Jupyter Notebook 100
6 年前
loading...
Website
Wikipedia