The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
2022-01-14
否
2024-03-23T04:47:28Z
The source code for the gym-microrts paper.
[WIP] Vectorized architecture for value-based methods such as DQN and DDPG
#计算机科学#PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
An educational resource to help anyone learn deep reinforcement learning.
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
#计算机科学#Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Open-Sora: 完全开源的高效复现类Sora视频生成方案
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
#计算机科学#Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
#计算机科学#A curated list of reinforcement learning with human feedback resources (continually updated)
#大语言模型#Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
eBPF distributed networking observability tool for Kubernetes
0 条讨论