GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

rl

Website
Wikipedia
https://static.github-zh.com/github_avatars/LlamaFamily?size=40
LlamaFamily / Llama-Chinese

#大语言模型#Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

llama大语言模型pretrainingagentllama4rl
Python 14.65 k
4 个月前
https://static.github-zh.com/github_avatars/google?size=40
google / dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

rl机器学习人工智能GoogleTensorflow
Jupyter Notebook 10.78 k
9 个月前
thu-ml/tianshou
https://static.github-zh.com/github_avatars/thu-ml?size=40
thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

PyTorchpolicy-gradientdqndouble-dqna2cddpgppotd3sacimitation-learningmujocoatarirlcql
Python 8.67 k
14 天前
OpenPipe/ART
https://static.github-zh.com/github_avatars/OpenPipe?size=40
OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, Kimi, and more!

大语言模型lorareinforcement-learningagentagentic-aigrporlkimi-aiqwenqwen3
Python 3.99 k
2 小时前
https://static.github-zh.com/github_avatars/junxiaosong?size=40
junxiaosong / AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

alphazeromctsalphago-zerogobangmonte-carlo-tree-searchalphagoreinforcement-learningrlboard-gameself-learningPyTorchTensorflow
Python 3.5 k
1 年前
https://static.github-zh.com/github_avatars/pytorch?size=40
pytorch / ELF

ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation

reinforcement-learningalphago-zerorlrl-environmentalpha-zeroGo
C++ 3.4 k
6 年前
pytorch/rl
https://static.github-zh.com/github_avatars/pytorch?size=40
pytorch / rl

#计算机科学#A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

人工智能controldecision-makingdistributed-computing机器学习marlmodel-based-reinforcement-learningmulti-agent-reinforcement-learningPyTorchreinforcement-learningrlRoboticstorch
Python 2.97 k
1 天前
https://static.github-zh.com/github_avatars/werner-duvaud?size=40
werner-duvaud / muzero-general

#计算机科学#MuZero

muzeroreinforcement-learningalphazeroPyTorchPythonself-learningmonte-carlo-tree-search深度学习deep-reinforcement-learning神经网络rltensorboardgymmctsalphago机器学习
Python 2.68 k
1 年前
DLR-RM/rl-baselines3-zoo
https://static.github-zh.com/github_avatars/DLR-RM?size=40
DLR-RM / rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

rlreinforcement-learningstable-baselinesopenaigympybullethyperparameter-optimizationhyperparameter-tuninghyperparameter-searchoptimizationsdeRoboticslabdeep-reinforcement-learningPyTorch
Python 2.5 k
6 天前
https://static.github-zh.com/github_avatars/IntelLabs?size=40
IntelLabs / coach

#计算机科学#Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

coachopenai-gymreinforcement-learningTensorflowrlcarlaimitation-learningmujocoroboschool深度学习starcraftstarcraft2mxnetonnx
Python 2.35 k
3 年前
https://static.github-zh.com/github_avatars/inclusionAI?size=40
inclusionAI / AReaL

#大语言模型#Distributed RL System for LLM Reasoning

大语言模型machine-learning-systemsmlsysreinforcement-learningrl
Python 2.1 k
1 天前
https://static.github-zh.com/github_avatars/PRIME-RL?size=40
PRIME-RL / PRIME

#大语言模型#Scalable RL solution for advanced reasoning of language models

大语言模型reasoningrl
Python 1.67 k
4 个月前
https://static.github-zh.com/github_avatars/MaximeVandegar?size=40
MaximeVandegar / Papers-in-100-Lines-of-Code

#计算机科学#Implementation of papers in 100 lines of code.

Pythonresearch深度学习机器学习educationalPyTorchpapersgenerative-modelnerf人工智能gansaes3Dmeta-learningneural-radiance-fieldsreinforcement-learningrldiffusion-models
Python 1.59 k
18 天前
https://static.github-zh.com/github_avatars/pathak22?size=40
pathak22 / noreward-rl

#计算机科学#[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning

deep-reinforcement-learningexploration深度学习rl深度神经网络mariodoomself-supervisedTensorflowopenai-gym
Python 1.45 k
3 年前
https://static.github-zh.com/github_avatars/zzli2022?size=40
zzli2022 / Awesome-System2-Reasoning-LLM

Latest Advances on System-2 Reasoning

benchmarkmctso1prmreasoningrlo3
Python 1.21 k
2 个月前
araffin/rl-baselines-zoo
https://static.github-zh.com/github_avatars/araffin?size=40
araffin / rl-baselines-zoo

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

rlreinforcement-learningstable-baselinesopenai-gymopenaigympybulletoptimizationhyperparameter-optimizationhyperparameter-searchhyperparameter-tuning
Python 1.18 k
3 年前
https://static.github-zh.com/github_avatars/sail-sg?size=40
sail-sg / understand-r1-zero

#大语言模型#Understanding R1-Zero-Like Training: A Critical Perspective

大语言模型reasoningrl
Python 1.05 k
7 天前
https://static.github-zh.com/github_avatars/FareedKhan-dev?size=40
FareedKhan-dev / all-rl-algorithms

#大语言模型#Implementation of all RL algorithms in a simpler way

agent大语言模型openaiPythonreinforcement-learningrl
Jupyter Notebook 1.01 k
3 个月前
https://static.github-zh.com/github_avatars/MushroomRL?size=40
MushroomRL / mushroom-rl

#计算机科学#Python library for Reinforcement Learning.

reinforcement-learningdeep-reinforcement-learning深度学习openai-gymatarirlPyTorchmujocodqnddpgpybulletsac
Python 893
21 天前
https://static.github-zh.com/github_avatars/google-research?size=40
google-research / rliable

#计算机科学#[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

reinforcement-learningbenchmarkingevaluation-metrics机器学习Googlerl
Jupyter Notebook 837
1 年前
loading...