GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

mcts

Website
Wikipedia
https://static.github-zh.com/github_avatars/hijkzzz?size=40
hijkzzz / Awesome-LLM-Strawberry

#大语言模型#A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

chain-of-thoughtCode大语言模型数学mctsopenai-o1strawberryreinforcement-learning
6.75 k
4 天前
https://static.github-zh.com/github_avatars/suragnair?size=40
suragnair / alpha-zero-general

#计算机科学#A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

TensorflowPyTorchKerasgobangalpha-zeroalphago-zeroalphagoreinforcement-learningself-playmctsmonte-carlo-tree-search深度学习alphazero神经网络
Jupyter Notebook 4.16 k
5 个月前
https://static.github-zh.com/github_avatars/junxiaosong?size=40
junxiaosong / AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

alphazeromctsalphago-zerogobangmonte-carlo-tree-searchalphagoreinforcement-learningrlboard-gameself-learningPyTorchTensorflow
Python 3.49 k
1 年前
https://static.github-zh.com/github_avatars/werner-duvaud?size=40
werner-duvaud / muzero-general

#计算机科学#MuZero

muzeroreinforcement-learningalphazeroPyTorchPythonself-learningmonte-carlo-tree-search深度学习deep-reinforcement-learning神经网络rltensorboardgymmctsalphago机器学习
Python 2.66 k
9 个月前
https://static.github-zh.com/github_avatars/opendilab?size=40
opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

alphazeroataricontinuous-controlmonte-carlo-tree-searchmuzeroPyTorchreinforcement-learningmctsboard-gamegymself-play
Python 1.39 k
4 天前
https://static.github-zh.com/github_avatars/zzli2022?size=40
zzli2022 / Awesome-System2-Reasoning-LLM

Latest Advances on System-2 Reasoning

benchmarkmctso1prmreasoningrlo3
Python 1.07 k
7 天前
https://static.github-zh.com/github_avatars/yaotingwangofficial?size=40
yaotingwangofficial / Awesome-MCoT

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

chain-of-thoughtcotdeepseek-r1instruction-tuninglarge-vision-language-modelmultimodalmultimodal-chain-of-thoughtmultimodal-large-language-modelsopenai-o1reasoningsurveymcts
642
1 个月前
https://static.github-zh.com/github_avatars/chauvinSimon?size=40
chauvinSimon / My_Bibliography_for_Research_on_Autonomous_Driving

Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"

reinforcement-learninginverse-reinforcement-learningplanningmodel-based-reinforcement-learningdecision-makinggame-theorymctspredictionbibliographycarlaimitation-learningend-to-endinteractionrisk-assessment
459
5 年前
https://static.github-zh.com/github_avatars/s-casci?size=40
s-casci / tinyzero

Easily train AlphaZero-like agents on any environment you want!

alphazeromctsreinforcement-learning
Python 430
1 年前
https://static.github-zh.com/github_avatars/hrpan?size=40
hrpan / tetris_mcts

#计算机科学#MCTS project for Tetris

reinforcement-learningmctstetris深度学习gametetris-bots
Python 348
8 个月前
https://static.github-zh.com/github_avatars/dylandjian?size=40
dylandjian / SuperGo

#计算机科学#A student implementation of Alpha Go Zero

alphago-zeroalphagoreinforcement-learningPyTorchmctsPython机器学习
Python 280
7 年前
https://static.github-zh.com/github_avatars/QueensGambit?size=40
QueensGambit / CrazyAra

#计算机科学#A Deep Learning UCI-Chess Variant Engine written in C++ & Python 🦜

Pythonchess-engine深度学习人工智能convolutional-neural-networkmctsalphazeromxnetgluonOpen Source机器学习lichessalphago
Jupyter Notebook 265
3 个月前
https://static.github-zh.com/github_avatars/DataCanvasIO?size=40
DataCanvasIO / Hypernets

A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.

neural-architecture-searchhyperparameter-optimizationhyperparameter-tuningevolutionary-algorithmsmonte-carlo-tree-searchautomlautodlreinforcement-learningmctsnasKeras
Python 265
2 个月前
https://static.github-zh.com/github_avatars/vgarciasc?size=40
vgarciasc / mcts-viz

Visualization of MCTS algorithm applied to Tic-tac-toe.

mcts可视化p5js
JavaScript 247
4 年前
https://static.github-zh.com/github_avatars/sungyubkim?size=40
sungyubkim / Deep_RL_with_pytorch

A pytorch tutorial for DRL(Deep Reinforcement Learning)

deep-reinforcement-learningPyTorchdqna2cpposoft-actor-criticmcts
Jupyter Notebook 216
2 年前
https://static.github-zh.com/github_avatars/initial-h?size=40
initial-h / AlphaZero_Gomoku_MPI

#算法刷题#An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

alphazeroparallelTensorflowalphagomctstensorlayertree-search算法deep-reinforcement-learning
Python 209
4 个月前
https://static.github-zh.com/github_avatars/thuxugang?size=40
thuxugang / doudizhu

AI斗地主

人工智能collectible-card-gamedqnreinforcement-learningdoudizhumcts
Python 184
7 年前
https://static.github-zh.com/github_avatars/kaesve?size=40
kaesve / muzero

#计算机科学#A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

muzeroalphazeroreinforcement-learningTensorflowtensorflow2mctstf2深度学习deep-reinforcement-learning
Jupyter Notebook 158
4 年前
https://static.github-zh.com/github_avatars/zjeffer?size=40
zjeffer / chess-deep-rl

#计算机科学#Research project: create a chess engine using Deep Reinforcement Learning

chessalphazeroreinforcement-learning人工智能神经网络mcts深度学习机器学习deep-reinforcement-learningneural-networkschess-engine
Jupyter Notebook 142
1 年前
https://static.github-zh.com/github_avatars/akolishchak?size=40
akolishchak / doom-net-pytorch

#学习与技能提升#Reinforcement learning models in ViZDoom environment

PyTorchvizdoomreinforcement-learningdoomagentlearningppomctsbehavior-tree
Python 131
3 年前
loading...