GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

self-play

Website
Wikipedia
https://static.github-zh.com/github_avatars/suragnair?size=40
suragnair / alpha-zero-general

#计算机科学#A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

TensorflowPyTorchKerasgobangalpha-zeroalphago-zeroalphagoreinforcement-learningself-playmctsmonte-carlo-tree-search深度学习alphazero神经网络
Jupyter Notebook 4.16 k
5 个月前
https://static.github-zh.com/github_avatars/opendilab?size=40
opendilab / DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

reinforcement-learningmultiagent-reinforcement-learningself-playimitation-learninginverse-reinforcement-learningexploration-exploitationdistributed-systemPythonimpalasmacatarimujocor2d2reinforcement-learning-algorithmspytorch-rlmodel-based-reinforcement-learning
Python 3.45 k
10 天前
https://static.github-zh.com/github_avatars/opendilab?size=40
opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

alphazeroataricontinuous-controlmonte-carlo-tree-searchmuzeroPyTorchreinforcement-learningmctsboard-gamegymself-play
Python 1.39 k
4 天前
https://static.github-zh.com/github_avatars/opendilab?size=40
opendilab / DI-star

#计算机科学#An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.

reinforcment-learningstarcraft2self-play人工智能深度学习leaguedeep-reinforcement-learning
Python 1.28 k
3 个月前
https://static.github-zh.com/github_avatars/uclaml?size=40
uclaml / SPIN

#计算机科学#The official implementation of Self-Play Fine-Tuning (SPIN)

深度学习fine-tuninglarge-language-modelsself-play
Python 1.16 k
1 年前
https://static.github-zh.com/github_avatars/uclaml?size=40
uclaml / SPPO

#计算机科学#The official implementation of Self-Play Preference Optimization (SPPO)

深度学习fine-tuninglarge-language-modelsrlhfself-play
Python 565
5 个月前
https://static.github-zh.com/github_avatars/inspirai?size=40
inspirai / TimeChamber

A Massively Parallel Large Scale Self-Play Framework

deep-reinforcement-learningreinforcement-learningself-playmulti-agent
Python 349
2 年前
https://static.github-zh.com/github_avatars/ChuaCheowHuan?size=40
ChuaCheowHuan / gym-continuousDoubleAuction

A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.

multi-agent-reinforcement-learninggym-environmentlimit-order-bookhigh-frequency-tradingrayrllibfinancial-engineeringself-playppoquantitative-financequantitative-tradingmarllstm
Jupyter Notebook 146
2 年前
https://static.github-zh.com/github_avatars/Naton1?size=40
Naton1 / osrs-pvp-reinforcement-learning

#计算机科学#Train a neural network to PvP in Old School RuneScape using reinforcement learning.

人工智能深度学习gymJava机器学习oldschool-runescapeosrsppoPythonPyTorchreinforcement-learningrspsrunescapeself-play
Java 112
1 年前
https://static.github-zh.com/github_avatars/blanyal?size=40
blanyal / alpha-zero

#计算机科学#AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement ...

alphazeroalpha-zeroalphago-zeroTensorflowreinforcement-learningmctsself-playgame深度学习机器学习resnettic-tac-toedeepmind
Python 90
7 年前
https://static.github-zh.com/github_avatars/seungeunrho?size=40
seungeunrho / football-paris

The exact codes used by the team "liveinparis" at the kaggle football competition ranked 6th/1141

self-playreinforcement-learningPyTorchppokaggle
Python 57
5 年前
https://static.github-zh.com/github_avatars/cestpasphoto?size=40
cestpasphoto / alpha-zero-general

A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available

alphagoalphago-zeroalphazeroPythonPyTorchreinforcement-learningnumbaself-play
Python 49
5 个月前
https://static.github-zh.com/github_avatars/dellalibera?size=40
dellalibera / gym-backgammon

Backgammon OpenAI Gym

gymreinforcement-learningself-playgameopenai-gym人工智能
Python 45
1 年前
https://static.github-zh.com/github_avatars/dellalibera?size=40
dellalibera / td-gammon

TD-Gammon implementation

人工智能reinforcement-learning神经网络PyTorchconvolutional-neural-networksself-playgame
Python 45
2 年前
https://static.github-zh.com/github_avatars/Sebastian-Schuchmann?size=40
Sebastian-Schuchmann / Self-Play-TicTacToe-AI-ML-Agents-

#计算机科学#A Self Play reinforcement learning Agent learns to play TicTacToe using the ML-Agents Framework in Unity.

人工智能机器学习reinforcement-learningml-agentsUnityself-play神经网络Tensorflow
C# 37
2 年前
https://static.github-zh.com/github_avatars/tobiasemrich?size=40
tobiasemrich / SchafkopfRL

AI agents for the bavarian card game Schafkopf trained with reinforcement learning

collectible-card-gamepporeinforcement-learningself-playPyTorch
Python 37
1 年前
https://static.github-zh.com/github_avatars/ShibiHe?size=40
ShibiHe / Model-Free-Episodic-Control

This is the implementation of paper Model Free Episodic Control

openai-gymdeepknnNumPyself-playgame-theory
Python 36
6 年前
https://static.github-zh.com/github_avatars/sirmammingtonham?size=40
sirmammingtonham / alphastone

#计算机科学#Using self-play, MCTS, and a deep neural network to create a hearthstone ai player

alpha-zeroself-playmonte-carlo-tree-search深度学习deep-reinforcement-learningPyTorchhearthstone人工智能
Python 29
7 年前
https://static.github-zh.com/github_avatars/cmubig?size=40
cmubig / sorts

Code base for Social Robot Tree Search (SoRTS).

mctsself-play
Python 26
1 年前
https://static.github-zh.com/github_avatars/mbaske?size=40
mbaske / ml-selfplay-fighter

Self-Play Boxing Match made with Unity Machine Learning Agents

Unityml-agentsself-play
C# 22
4 年前
loading...