GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

self-play

Website
Wikipedia
https://static.github-zh.com/github_avatars/suragnair?size=40
suragnair / alpha-zero-general

#计算机科学#A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

TensorflowPyTorchKerasgobangalpha-zeroalphago-zeroalphagoreinforcement-learningself-playmctsmonte-carlo-tree-search深度学习alphazero神经网络
Jupyter Notebook 4.2 k
7 个月前
https://static.github-zh.com/github_avatars/opendilab?size=40
opendilab / DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

reinforcement-learningmultiagent-reinforcement-learningself-playimitation-learninginverse-reinforcement-learningexploration-exploitationdistributed-systemPythonimpalasmacatarimujocor2d2reinforcement-learning-algorithmspytorch-rlmodel-based-reinforcement-learning
Python 3.5 k
3 天前
https://static.github-zh.com/github_avatars/opendilab?size=40
opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

alphazeroataricontinuous-controlmonte-carlo-tree-searchmuzeroPyTorchreinforcement-learningmctsboard-gamegymself-play
Python 1.42 k
2 天前
https://static.github-zh.com/github_avatars/opendilab?size=40
opendilab / DI-star

#计算机科学#An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.

reinforcment-learningstarcraft2self-play人工智能深度学习leaguedeep-reinforcement-learning
Python 1.29 k
5 个月前
https://static.github-zh.com/github_avatars/uclaml?size=40
uclaml / SPIN

#计算机科学#The official implementation of Self-Play Fine-Tuning (SPIN)

深度学习fine-tuninglarge-language-modelsself-play
Python 1.18 k
1 年前
https://static.github-zh.com/github_avatars/uclaml?size=40
uclaml / SPPO

#计算机科学#The official implementation of Self-Play Preference Optimization (SPPO)

深度学习fine-tuninglarge-language-modelsrlhfself-play
Python 570
6 个月前
https://static.github-zh.com/github_avatars/inspirai?size=40
inspirai / TimeChamber

A Massively Parallel Large Scale Self-Play Framework

deep-reinforcement-learningreinforcement-learningself-playmulti-agent
Python 351
3 年前
https://static.github-zh.com/github_avatars/ChuaCheowHuan?size=40
ChuaCheowHuan / gym-continuousDoubleAuction

A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.

multi-agent-reinforcement-learninggym-environmentlimit-order-bookhigh-frequency-tradingrayrllibfinancial-engineeringself-playppoquantitative-financequantitative-tradingmarllstm
Jupyter Notebook 148
5 天前
https://static.github-zh.com/github_avatars/spiral-rl?size=40
spiral-rl / spiral

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

large-language-modelsself-playmulti-agent-reinforcement-learningreinforcement-learning
Python 124
6 天前
https://static.github-zh.com/github_avatars/Naton1?size=40
Naton1 / osrs-pvp-reinforcement-learning

#计算机科学#Train a neural network to PvP in Old School RuneScape using reinforcement learning.

人工智能深度学习gymJava机器学习oldschool-runescapeosrsppoPythonPyTorchreinforcement-learningrspsrunescapeself-play
Java 119
1 年前
https://static.github-zh.com/github_avatars/blanyal?size=40
blanyal / alpha-zero

#计算机科学#AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement ...

alphazeroalpha-zeroalphago-zeroTensorflowreinforcement-learningmctsself-playgame深度学习机器学习resnettic-tac-toedeepmind
Python 90
7 年前
https://static.github-zh.com/github_avatars/seungeunrho?size=40
seungeunrho / football-paris

The exact codes used by the team "liveinparis" at the kaggle football competition ranked 6th/1141

self-playreinforcement-learningPyTorchppokaggle
Python 57
5 年前
https://static.github-zh.com/github_avatars/cestpasphoto?size=40
cestpasphoto / alpha-zero-general

A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available

alphagoalphago-zeroalphazeroPythonPyTorchreinforcement-learningnumbaself-play
Python 51
5 天前
https://static.github-zh.com/github_avatars/dellalibera?size=40
dellalibera / gym-backgammon

Backgammon OpenAI Gym

gymreinforcement-learningself-playgameopenai-gym人工智能
Python 48
1 年前
https://static.github-zh.com/github_avatars/dellalibera?size=40
dellalibera / td-gammon

TD-Gammon implementation

人工智能reinforcement-learning神经网络PyTorchconvolutional-neural-networksself-playgame
Python 48
2 年前
https://static.github-zh.com/github_avatars/Sebastian-Schuchmann?size=40
Sebastian-Schuchmann / Self-Play-TicTacToe-AI-ML-Agents-

#计算机科学#A Self Play reinforcement learning Agent learns to play TicTacToe using the ML-Agents Framework in Unity.

人工智能机器学习reinforcement-learningml-agentsUnityself-play神经网络Tensorflow
C# 37
3 年前
https://static.github-zh.com/github_avatars/tobiasemrich?size=40
tobiasemrich / SchafkopfRL

AI agents for the bavarian card game Schafkopf trained with reinforcement learning

collectible-card-gamepporeinforcement-learningself-playPyTorch
Python 37
15 天前
https://static.github-zh.com/github_avatars/ShibiHe?size=40
ShibiHe / Model-Free-Episodic-Control

This is the implementation of paper Model Free Episodic Control

openai-gymdeepknnNumPyself-playgame-theory
Python 36
6 年前
https://static.github-zh.com/github_avatars/sirmammingtonham?size=40
sirmammingtonham / alphastone

#计算机科学#Using self-play, MCTS, and a deep neural network to create a hearthstone ai player

alpha-zeroself-playmonte-carlo-tree-search深度学习deep-reinforcement-learningPyTorchhearthstone人工智能
Python 29
7 年前
https://static.github-zh.com/github_avatars/arianahejazyan?size=40
arianahejazyan / Athena

#计算机科学#A UCI-compatible four-player chess engine powered by deep RL and 256-bit bitboards.

人工智能chesschess-engine深度学习neural-networksC++游戏开发reinforcement-learningchess-aideep-rluciself-play
C++ 28
17 天前
loading...