GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

rl

Website
Wikipedia
https://static.github-zh.com/github_avatars/LlamaFamily?size=40
LlamaFamily / Llama-Chinese

#大语言模型#Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

llama大语言模型pretrainingagentllama4rl
Python 14.61 k
2 个月前
https://static.github-zh.com/github_avatars/google?size=40
google / dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

rl机器学习人工智能GoogleTensorflow
Jupyter Notebook 10.75 k
7 个月前
thu-ml/tianshou
https://static.github-zh.com/github_avatars/thu-ml?size=40
thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

PyTorchpolicy-gradientdqndouble-dqna2cddpgppotd3sacimitation-learningmujocoatarirlcql
Python 8.57 k
10 天前
https://static.github-zh.com/github_avatars/junxiaosong?size=40
junxiaosong / AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

alphazeromctsalphago-zerogobangmonte-carlo-tree-searchalphagoreinforcement-learningrlboard-gameself-learningPyTorchTensorflow
Python 3.49 k
1 年前
https://static.github-zh.com/github_avatars/pytorch?size=40
pytorch / ELF

ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation

reinforcement-learningalphago-zerorlrl-environmentalpha-zeroGo
C++ 3.4 k
6 年前
pytorch/rl
https://static.github-zh.com/github_avatars/pytorch?size=40
pytorch / rl

#计算机科学#A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

人工智能controldecision-makingdistributed-computing机器学习marlmodel-based-reinforcement-learningmulti-agent-reinforcement-learningPyTorchreinforcement-learningrlRoboticstorch
Python 2.83 k
2 天前
https://static.github-zh.com/github_avatars/werner-duvaud?size=40
werner-duvaud / muzero-general

#计算机科学#MuZero

muzeroreinforcement-learningalphazeroPyTorchPythonself-learningmonte-carlo-tree-search深度学习deep-reinforcement-learning神经网络rltensorboardgymmctsalphago机器学习
Python 2.66 k
9 个月前
DLR-RM/rl-baselines3-zoo
https://static.github-zh.com/github_avatars/DLR-RM?size=40
DLR-RM / rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

rlreinforcement-learningstable-baselinesopenaigympybullethyperparameter-optimizationhyperparameter-tuninghyperparameter-searchoptimizationsdeRoboticslabdeep-reinforcement-learningPyTorch
Python 2.46 k
13 天前
https://static.github-zh.com/github_avatars/IntelLabs?size=40
IntelLabs / coach

#计算机科学#Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

coachopenai-gymreinforcement-learningTensorflowrlcarlaimitation-learningmujocoroboschool深度学习starcraftstarcraft2mxnetonnx
Python 2.35 k
3 年前
https://static.github-zh.com/github_avatars/inclusionAI?size=40
inclusionAI / AReaL

#大语言模型#Distributed RL System for LLM Reasoning

大语言模型machine-learning-systemsmlsysreinforcement-learningrl
Python 1.66 k
5 天前
https://static.github-zh.com/github_avatars/MaximeVandegar?size=40
MaximeVandegar / Papers-in-100-Lines-of-Code

#计算机科学#Implementation of papers in 100 lines of code.

Pythonresearch深度学习机器学习educationalPyTorchpapersgenerative-modelnerf人工智能gansaes3Dmeta-learningneural-radiance-fieldsreinforcement-learningrldiffusion-models
Python 1.56 k
1 个月前
https://static.github-zh.com/github_avatars/pathak22?size=40
pathak22 / noreward-rl

#计算机科学#[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning

deep-reinforcement-learningexploration深度学习rl深度神经网络mariodoomself-supervisedTensorflowopenai-gym
Python 1.44 k
3 年前
araffin/rl-baselines-zoo
https://static.github-zh.com/github_avatars/araffin?size=40
araffin / rl-baselines-zoo

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

rlreinforcement-learningstable-baselinesopenai-gymopenaigympybulletoptimizationhyperparameter-optimizationhyperparameter-searchhyperparameter-tuning
Python 1.18 k
3 年前
https://static.github-zh.com/github_avatars/zzli2022?size=40
zzli2022 / Awesome-System2-Reasoning-LLM

Latest Advances on System-2 Reasoning

benchmarkmctso1prmreasoningrlo3
Python 1.07 k
7 天前
https://static.github-zh.com/github_avatars/sail-sg?size=40
sail-sg / understand-r1-zero

#大语言模型#Understanding R1-Zero-Like Training: A Critical Perspective

大语言模型reasoningrl
Python 979
23 天前
https://static.github-zh.com/github_avatars/FareedKhan-dev?size=40
FareedKhan-dev / all-rl-algorithms

#大语言模型#Implementation of all RL algorithms in a simpler way

agent大语言模型openaiPythonreinforcement-learningrl
Jupyter Notebook 897
2 个月前
https://static.github-zh.com/github_avatars/MushroomRL?size=40
MushroomRL / mushroom-rl

#计算机科学#Python library for Reinforcement Learning.

reinforcement-learningdeep-reinforcement-learning深度学习openai-gymatarirlPyTorchmujocodqnddpgpybulletsac
Python 885
2 个月前
https://static.github-zh.com/github_avatars/google-research?size=40
google-research / rliable

#计算机科学#[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

reinforcement-learningbenchmarkingevaluation-metrics机器学习Googlerl
Jupyter Notebook 832
10 个月前
https://static.github-zh.com/github_avatars/google-research?size=40
google-research / seed_rl

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

rlimpalar2d2atarideepmind-labgoogle-research-footballtf2Google 云
Python 825
3 年前
https://static.github-zh.com/github_avatars/Toni-SM?size=40
Toni-SM / skrl

#计算机科学#Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab

reinforcement-learningPythonopenai-gymPyTorch深度学习deepmindgymisaac-simrl机器学习Roboticsjax
Python 782
7 天前
loading...