GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

offline-reinforcement-learning

Website
Wikipedia
https://static.github-zh.com/github_avatars/tinkoff-ai?size=40
tinkoff-ai / CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

d4rlgymoffline-reinforcement-learningreinforcement-learning
Python 1.22 k
2 年前
https://static.github-zh.com/github_avatars/ikostrikov?size=40
ikostrikov / jaxrl

#计算机科学#JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

深度学习deep-reinforcement-learningcontinuous-controlreinforcement-learningsoft-actor-criticsacdeep-deterministic-policy-gradientjaxflaxgymoffline-reinforcement-learning
Jupyter Notebook 685
3 年前
https://static.github-zh.com/github_avatars/yihaosun1124?size=40
yihaosun1124 / OfflineRL-Kit

#计算机科学#An elegant PyTorch offline reinforcement learning library for researchers.

深度学习deep-reinforcement-learningPyTorchreinforcement-learningoffline-reinforcement-learning
Python 342
1 年前
https://static.github-zh.com/github_avatars/Allenpandas?size=40
Allenpandas / Reinforcement-Learning-Papers

📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.

deep-reinforcement-learningreinforcement-learningdqnimitation-learningmulti-agent-reinforcement-learningpolicy-gradientq-learning人工智能aaaiicmlneuripsoffline-reinforcement-learning
325
1 年前
https://static.github-zh.com/github_avatars/Cryolite?size=40
Cryolite / kanachan

#计算机科学#A Japanese (Riichi) Mahjong AI Framework

mahjongriichi-mahjongmajsoul机器学习game-aireinforcement-learning深度学习deep-reinforcement-learningtransformerstransformerimitation-learningoffline-reinforcement-learningdqn
Python 313
4 个月前
https://static.github-zh.com/github_avatars/nikhilbarhate99?size=40
nikhilbarhate99 / min-decision-transformer

#计算机科学#Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym

reinforcement-learningdeep-reinforcement-learning深度学习offline-reinforcement-learningPyTorchpytorch-transformerstransformer机器学习openai-gymmujocoRobotics
Python 274
3 年前
https://static.github-zh.com/github_avatars/polixir?size=40
polixir / OfflineRL

A collection of offline reinforcement learning algorithms.

offline-reinforcement-learningreinforcement-learning
Python 188
7 个月前
https://static.github-zh.com/github_avatars/instadeepai?size=40
instadeepai / og-marl

Datasets with baselines for offline multi-agent reinforcement learning.

multi-agent-reinforcement-learningreinforcement-learningoffline-reinforcement-learning
Python 170
1 个月前
https://static.github-zh.com/github_avatars/nissymori?size=40
nissymori / JAX-CORL

Clean single-file implementation of offline RL algorithms in JAX

jaxsingle-fileflaxcqlreinforcement-learningd4rloffline-reinforcement-learning
Python 145
6 个月前
https://static.github-zh.com/github_avatars/BY571?size=40
BY571 / CQL

#计算机科学#PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.

reinforcement-learning-algorithmsoffline-reinforcement-learningdqnsacpytorch-implementationPyTorch机器学习
Python 137
1 年前
https://static.github-zh.com/github_avatars/polixir?size=40
polixir / NeoRL

Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets

offline-reinforcement-learning
Python 123
7 个月前
https://static.github-zh.com/github_avatars/ZhengyaoJiang?size=40
ZhengyaoJiang / latentplan

Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.

model-based-reinforcement-learningoffline-reinforcement-learningreinforcement-learninggenerative-modeltransformer
Python 107
2 年前
https://static.github-zh.com/github_avatars/ZhengYinan-AIR?size=40
ZhengYinan-AIR / FISOR

[ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"

diffusion-modelsjaxoffline-reinforcement-learningimitation-learningreinforcement-learning
Python 101
4 个月前
https://static.github-zh.com/github_avatars/silverwingsbot?size=40
silverwingsbot / EasyCarla-RL

A simple and easy-to-use autonomous driving environment for reinforcement learning, based on the CARLA simulator.

autonomous-drivingautonomous-vehiclescarlacarla-simulatorgymoffline-reinforcement-learningreinforcement-learningrlself-drivingdecision-making
Python 95
1 个月前
https://static.github-zh.com/github_avatars/EmptyJackson?size=40
EmptyJackson / unifloral

Unified Implementations of Offline Reinforcement Learning Algorithms

d4rljaxoffline-reinforcement-learningflaxwandb
Python 80
2 个月前
https://static.github-zh.com/github_avatars/snu-mllab?size=40
snu-mllab / EDAC

Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)

offline-reinforcement-learning
Python 75
3 年前
https://static.github-zh.com/github_avatars/DHDev0?size=40
DHDev0 / Stochastic-muzero

#计算机科学#Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variation...

机器学习offline-reinforcement-learningdeep-reinforcement-learninggym-environmentslstmmonte-carlo-tree-searchmuzeroPyTorchrltransformermultilayer-perceptron
Python 66
2 年前
https://static.github-zh.com/github_avatars/ryanxhr?size=40
ryanxhr / POR

[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"

offline-reinforcement-learningPyTorch
Python 57
2 年前
https://static.github-zh.com/github_avatars/tinkoff-ai?size=40
tinkoff-ai / ReBRAC

Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC

offline-reinforcement-learningreinforcement-learning
Jupyter Notebook 55
2 年前
https://static.github-zh.com/github_avatars/tinkoff-ai?size=40
tinkoff-ai / sac-rnd

Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023

offline-reinforcement-learningdeep-reinforcement-learning
Python 53
2 年前
loading...