muzero · GitHub Topics

muzero reinforcement-learning alphazero PyTorch Python self-learning monte-carlo-tree-search 深度学习 deep-reinforcement-learning 神经网络 rl tensorboard gym mcts alphago 机器学习

Python 2.68 k

1 年前

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

alphazero atari continuous-control monte-carlo-tree-search muzero PyTorch reinforcement-learning mcts board-game gym self-play

Python 1.42 k

2 天前

huawei-noah / xingtian

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

impala dqn ppo muzero reinforcement-learning-algorithms

Python 311

2 年前

johan-gras / MuZero

A structured implementation of MuZero

muzero world-models reinforcement-learning Tensorflow

Python 205

3 年前

kaesve / muzero

#计算机科学#A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

muzero alphazero reinforcement-learning Tensorflow tensorflow2 mcts tf2 深度学习 deep-reinforcement-learning

Jupyter Notebook 160

4 年前

yenw / computer-go-dataset

datasets for computer go

Go alphago alphazero muzero

C++ 153

1 年前

rlglab / minizero

MiniZero: An AlphaZero and MuZero Training Framework

alphazero deep-reinforcement-learning mcts muzero monte-carlo-tree-search atari Go hex reinforcement-learning

C++ 96

6 天前

Zeta36 / muzero

A simple implementation of MuZero algorithm for connect4 game

muzero Python PyTorch deepmind Jupyter Notebook

Jupyter Notebook 96

5 年前

DHDev0 / Stochastic-muzero

#计算机科学#Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variation...

机器学习 offline-reinforcement-learning deep-reinforcement-learning gym-environments lstm monte-carlo-tree-search muzero PyTorch rl transformer multilayer-perceptron

Python 69

2 年前

Hwhitetooth / jax_muzero

#计算机科学#An implementation of MuZero in JAX.

reinforcement-learning 深度学习 deep-reinforcement-learning model-based-reinforcement-learning muzero jax

Python 56

3 年前

hr0nix / omega

A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

jax model-based-reinforcement-learning muzero reinforcement-learning flax mcts

Python 41

3 年前

tuero / muzero-cpp

#计算机科学#A C++ pytorch implementation of MuZero

C++PyTorch 机器学习 reinforcement-learning mcts alphazero muzero libtorch

C++ 39

1 年前

michaelnny / muzero

A PyTorch implementation of DeepMind's MuZero agent

alphazero muzero PyTorch reinforcement-learning

Python 35

2 年前

DHDev0 / Muzero-unplugged

#计算机科学#Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations...

深度学习 deep-reinforcement-learning gym lstm 机器学习神经网络 Python PyTorch reinforcement-learning transformer arxiv gym-environments monte-carlo-tree-search muzero rl

Python 29

1 个月前

sail-sg / rosmo

Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023

atari muzero offline-reinforcement-learning reinforcement-learning jax model-based-reinforcement-learning

Python 29

2 年前

bellerb / chappie.ai

Generalized AI to perform a multitude of tasks written in python3

机器学习人工智能 muzero mcts chess-ai PyTorch attention-mechanism transformer Python

Jupyter Notebook 21

2 年前

rlglab / optionzero

[ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm

atari mcts muzero reinforcement-learning deep-reinforcement-learning monte-carlo-tree-search

C++ 19

2 个月前

DHDev0 / Muzero

#计算机科学#Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.