GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

exploration-exploitation

Website
Wikipedia
https://static.github-zh.com/github_avatars/opendilab?size=40
opendilab / DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

reinforcement-learningmultiagent-reinforcement-learningself-playimitation-learninginverse-reinforcement-learningexploration-exploitationdistributed-systemPythonimpalasmacatarimujocor2d2reinforcement-learning-algorithmspytorch-rlmodel-based-reinforcement-learning
Python 3.45 k
9 天前
https://static.github-zh.com/github_avatars/wzhe06?size=40
wzhe06 / Reco-papers

#计算机科学#Classic papers and resources on recommendation

recommender-system深度学习机器学习recommendationexploration-exploitationreinforcement-learning
Python 3.37 k
5 年前
tigerneil/awesome-deep-rl
https://static.github-zh.com/github_avatars/tigerneil?size=40
tigerneil / awesome-deep-rl

For deep RL and the future of AI.

deep-reinforcement-learningreinforcement-learninggameartificial-general-intelligenceexploration-exploitationmultiagent-reinforcement-learningplanningicmlaaaiagiiclr
HTML 1.46 k
1 年前
https://static.github-zh.com/github_avatars/imsheridan?size=40
imsheridan / DeepRec

#计算机科学#推荐、广告工业界经典以及最前沿的论文、资料集合/ Must-read Papers on Recommendation System and CTR Prediction

深度学习recommendation-systemrecommendationreinforcement-learningexploration-exploitation
1.01 k
1 年前
https://static.github-zh.com/github_avatars/david-cortes?size=40
david-cortes / contextualbandits

Python implementations of contextual bandits algorithms

contextual-banditsreinforcement-learningexploration-exploitation
Python 788
1 个月前
https://static.github-zh.com/github_avatars/opendilab?size=40
opendilab / awesome-exploration-rl

#Awesome#A curated list of awesome exploration RL resources (continually updated)

exploration-exploitationreinforcement-learningAwesome Listsexplorationexploratoryreinforcement-learning-algorithms
490
4 个月前
https://static.github-zh.com/github_avatars/YaoYao1995?size=40
YaoYao1995 / MEEE

Code to reproduce the experiments in Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation (MEEE).

reinforcement-learningexploration-exploitationmodel-based-reinforcement-learning
Python 462
2 年前
https://static.github-zh.com/github_avatars/TianhongDai?size=40
TianhongDai / self-imitation-learning-pytorch

This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.

reinforcement-learning-algorithmsexploration-exploitationa2catari-games
Python 66
7 年前
https://static.github-zh.com/github_avatars/holarissun?size=40
holarissun / RewardShifting

Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL

ensembleexploration-exploitationoffline-reinforcement-learningreinforcement-learningdeep-q-networkensemble-learning
Python 30
2 年前
https://static.github-zh.com/github_avatars/stratisMarkou?size=40
stratisMarkou / sample-efficient-bayesian-rl

Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL

reinforcement-learningbayesian-methodsbayesian-inferenceq-learningexploration-exploitationexplorationreproducible-research
Jupyter Notebook 25
3 年前
https://static.github-zh.com/github_avatars/hmishfaq?size=40
hmishfaq / LSAC

The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025

exploration-exploitationpolicy-gradientreinforcement-learningsoft-actor-critic
Python 10
18 天前
https://static.github-zh.com/github_avatars/gokceuludogan?size=40
gokceuludogan / interactive-music-recommendation

Personalized and Interactive Music Recommendation with Bandit approach

exploration-exploitation
Jupyter Notebook 10
6 年前
https://static.github-zh.com/github_avatars/Amshra267?size=40
Amshra267 / Thompson-Greedy-Comparison-for-MultiArmed-Bandits

Repository Containing Comparison of two methods for dealing with Exploration-Exploitation dilemma for MultiArmed Bandits

exploration-exploitation
Python 10
4 年前
https://static.github-zh.com/github_avatars/mbhenaff?size=40
mbhenaff / neural-e3

#计算机科学#

deep-reinforcement-learningexploration-exploitation深度学习
Python 7
5 年前
https://static.github-zh.com/github_avatars/kakaobrain?size=40
kakaobrain / leco

Official implementation of LECO (NeurIPS'22)

exploration-exploitationreinforcement-learning
Python 7
2 年前
https://static.github-zh.com/github_avatars/hmishfaq?size=40
hmishfaq / LMC-LSVI

The official code release for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo, ICLR 2024.

exploration-exploitationreinforcement-learning
Python 7
1 年前
https://static.github-zh.com/github_avatars/kkm24132?size=40
kkm24132 / ReinforcementLearning

Focuses on Reinforcement Learning related concepts, use cases, and learning approaches

reinforcement-learningexploration-exploitationsarsaq-learningpolicy-gradient
Jupyter Notebook 7
1 个月前
https://static.github-zh.com/github_avatars/baturaysaglam?size=40
baturaysaglam / DISCOVER

Deep Intrinsically Motivated Exploration in Continuous Control

actor-criticdeep-reinforcement-learningexploration-exploitation
Python 5
1 年前
https://static.github-zh.com/github_avatars/guptav96?size=40
guptav96 / bandit-algorithms

A short implementation of bandit algorithms - ETC, UCB, MOSS and KL-UCB

reinforcement-learningexploration-exploitation
Python 5
3 年前
https://static.github-zh.com/github_avatars/panxulab?size=40
panxulab / LSVI-ASE

The official code release for "More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling", Reinforcement Learning Conference (RLC) 2024

exploration-exploitationreinforcement-learning
Python 4
1 年前
loading...