GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

muzero

Website
Wikipedia
https://static.github-zh.com/github_avatars/werner-duvaud?size=40
werner-duvaud / muzero-general

#计算机科学#MuZero

muzeroreinforcement-learningalphazeroPyTorchPythonself-learningmonte-carlo-tree-search深度学习deep-reinforcement-learning神经网络rltensorboardgymmctsalphago机器学习
Python 2.66 k
9 个月前
https://static.github-zh.com/github_avatars/opendilab?size=40
opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

alphazeroataricontinuous-controlmonte-carlo-tree-searchmuzeroPyTorchreinforcement-learningmctsboard-gamegymself-play
Python 1.39 k
4 天前
https://static.github-zh.com/github_avatars/huawei-noah?size=40
huawei-noah / xingtian

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

impaladqnppomuzeroreinforcement-learning-algorithms
Python 311
2 年前
https://static.github-zh.com/github_avatars/johan-gras?size=40
johan-gras / MuZero

A structured implementation of MuZero

muzeroworld-modelsreinforcement-learningTensorflow
Python 204
3 年前
https://static.github-zh.com/github_avatars/kaesve?size=40
kaesve / muzero

#计算机科学#A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

muzeroalphazeroreinforcement-learningTensorflowtensorflow2mctstf2深度学习deep-reinforcement-learning
Jupyter Notebook 158
4 年前
https://static.github-zh.com/github_avatars/yenw?size=40
yenw / computer-go-dataset

datasets for computer go

Goalphagoalphazeromuzero
C++ 153
1 年前
https://static.github-zh.com/github_avatars/Zeta36?size=40
Zeta36 / muzero

A simple implementation of MuZero algorithm for connect4 game

muzeroPythonPyTorchdeepmindJupyter Notebook
Jupyter Notebook 97
5 年前
https://static.github-zh.com/github_avatars/rlglab?size=40
rlglab / minizero

MiniZero: An AlphaZero and MuZero Training Framework

alphazerodeep-reinforcement-learningmctsmuzeromonte-carlo-tree-searchatariGohexreinforcement-learning
C++ 93
4 个月前
https://static.github-zh.com/github_avatars/DHDev0?size=40
DHDev0 / Stochastic-muzero

#计算机科学#Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variation...

机器学习offline-reinforcement-learningdeep-reinforcement-learninggym-environmentslstmmonte-carlo-tree-searchmuzeroPyTorchrltransformermultilayer-perceptron
Python 66
2 年前
https://static.github-zh.com/github_avatars/Hwhitetooth?size=40
Hwhitetooth / jax_muzero

#计算机科学#An implementation of MuZero in JAX.

reinforcement-learning深度学习deep-reinforcement-learningmodel-based-reinforcement-learningmuzerojax
Python 56
3 年前
https://static.github-zh.com/github_avatars/hr0nix?size=40
hr0nix / omega

A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

jaxmodel-based-reinforcement-learningmuzeroreinforcement-learningflaxmcts
Python 41
3 年前
https://static.github-zh.com/github_avatars/tuero?size=40
tuero / muzero-cpp

#计算机科学#A C++ pytorch implementation of MuZero

C++PyTorch机器学习reinforcement-learningmctsalphazeromuzerolibtorch
C++ 38
1 年前
https://static.github-zh.com/github_avatars/michaelnny?size=40
michaelnny / muzero

A PyTorch implementation of DeepMind's MuZero agent

alphazeromuzeroPyTorchreinforcement-learning
Python 33
2 年前
https://static.github-zh.com/github_avatars/sail-sg?size=40
sail-sg / rosmo

Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023

atarimuzerooffline-reinforcement-learningreinforcement-learningjaxmodel-based-reinforcement-learning
Python 29
2 年前
https://static.github-zh.com/github_avatars/DHDev0?size=40
DHDev0 / Muzero-unplugged

#计算机科学#Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations...

深度学习deep-reinforcement-learninggymlstm机器学习神经网络PythonPyTorchreinforcement-learningtransformerarxivgym-environmentsmonte-carlo-tree-searchmuzerorl
Python 27
2 年前
https://static.github-zh.com/github_avatars/bellerb?size=40
bellerb / chappie.ai

Generalized AI to perform a multitude of tasks written in python3

机器学习人工智能muzeromctschess-aiPyTorchattention-mechanismtransformerPython
Jupyter Notebook 21
2 年前
https://static.github-zh.com/github_avatars/rlglab?size=40
rlglab / optionzero

[ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm

atarimctsmuzeroreinforcement-learningdeep-reinforcement-learningmonte-carlo-tree-search
C++ 17
1 个月前
https://static.github-zh.com/github_avatars/DHDev0?size=40
DHDev0 / Muzero

#计算机科学#Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.

arxiv深度学习deep-reinforcement-learning机器学习monte-carlo-tree-searchmuzero神经网络PythonPyTorchreinforcement-learningrlgymgym-environmentslstmtransformer
Python 17
2 年前
https://static.github-zh.com/github_avatars/jianzhnie?size=40
jianzhnie / RLZero

A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.

alpha-zeromctsmuzeroreinforcement-learningself-playmulti-agent
Python 16
8 个月前
https://static.github-zh.com/github_avatars/Itomigna2?size=40
Itomigna2 / Muesli-lunarlander

#计算机科学#Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)

colabreinforcement-learning深度学习muzero
Jupyter Notebook 16
1 年前
loading...