GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

sarsa-lambda

Website
Wikipedia
https://static.github-zh.com/github_avatars/MorvanZhou?size=40
MorvanZhou / Reinforcement-learning-with-tensorflow

#计算机科学#Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

reinforcement-learning教程q-learningsarsasarsa-lambdadeep-q-networka3cddpgpolicy-gradientdqndouble-dqndueling-dqndeep-deterministic-policy-gradientactor-criticTensorflowproximal-policy-optimizationppo机器学习
Python 9.21 k
1 年前
https://static.github-zh.com/github_avatars/chen0040?size=40
chen0040 / java-reinforcement-learning

Package provides java implementation of reinforcement learning algorithms such Q-Learn, R-Learn, SARSA, Actor-Critic

reinforcement-learningq-learningsarsasarsa-lambdaactor-criticJava
Java 128
6 年前
https://static.github-zh.com/github_avatars/DavidMouse1118?size=40
DavidMouse1118 / Reinforcement-Learning-Maze-World

SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis

reinforcement-learningq-learningsarsa-lambda
Python 30
6 年前
https://static.github-zh.com/github_avatars/NeymarL?size=40
NeymarL / Pacman-RL

Implement some reinforcement learning algorithms, test and visualize on Pacman.

reinforcement-learningpacmanq-learningsarsa-lambdapolicypolicy-gradientactor-critic
Python 27
7 年前
https://static.github-zh.com/github_avatars/moripiri?size=40
moripiri / Reinforcement-Learning-on-FrozenLake

Reinforcement Learning Algorithms in FrozenLake-v1

reinforcement-learningopenai-gymreinforcement-learning-algorithmsactor-criticq-learningreinforcesarsasarsa-lambda
Jupyter Notebook 24
4 个月前
https://static.github-zh.com/github_avatars/leofansq?size=40
leofansq / Reinforcement_Learning_Curling

基于强化学习(RL)的冰壶游戏实例; 梯度下降的Sarsa(lambda) + 非均匀径向基特征表示

reinforcement-learningsarsa-lambdaDemo
Python 21
5 年前
https://static.github-zh.com/github_avatars/leolellisr?size=40
leolellisr / poke_RL

Code repository with classical reinforcement learning and deep reinforcement learning methods for Pokémon battles in Pokémon Showdown.

pokemonreinforcement-learningsarsa-lambdafunction-approximationgamedqndouble-dqnppoppo2deep-reinforcement-learningdeep-rlreinforcepokemon-showdown
Jupyter Notebook 13
7 个月前
https://static.github-zh.com/github_avatars/RicardoDominguez?size=40
RicardoDominguez / RL-Intro

Introduction to Reinforcement Learning in Python

reinforcement-learningreinforcement-learning-algorithmssarsasarsa-lambdaq-learningactor-critictree-search
Python 13
7 年前
https://static.github-zh.com/github_avatars/timbmg?size=40
timbmg / easy21-rl

Easy21 assignment from David Silver's RL Course at UCL

reinforcement-learningreinforcement-learning-excercisesfunction-approximationrlsarsasarsa-lambda
Python 12
7 年前
https://static.github-zh.com/github_avatars/imraviagrawal?size=40
imraviagrawal / Reinforcement-Learning-Implementation

Implementation of Reinforcement Algorithms from scratch

reinforcement-learningblackbox-optimizationsarsaq-learningsarsa-lambdaactor-critic-algorithmreinforce
Python 9
7 年前
https://static.github-zh.com/github_avatars/wikke?size=40
wikke / Reinforcement_Learning

Series of Reinforcement Learning: Q-Learning, Sarsa, SarsaLambda, Deep Q Learning(DQN);一些列强化学习算法,玩OpenAI-gym游戏

dqndeep-q-learningdeep-q-networkreinforcement-learningopenai-gymq-learningsarsasarsa-lambda
Python 9
8 年前
https://static.github-zh.com/github_avatars/Pegah-Ardehkhani?size=40
Pegah-Ardehkhani / Shortest-Path-using-Reinforcement-Learning

Solve the shortest path problem using Reinforcement Learning. This project applies RL techniques, such as Q-learning and SARSA(λ), to find optimal routes in a weighted graph, where the algorithm learn...

q-learningreinforcement-learningreinforcement-learning-algorithmssarsasarsa-lambda
Jupyter Notebook 5
6 个月前
https://static.github-zh.com/github_avatars/nilskruse?size=40
nilskruse / mdp

Markov decision process master thesis

markov-decision-processesq-learningreinforcement-learningreinforcement-learning-algorithmsreinforcement-learning-environmentsRustsarsasarsa-lambda
Rust 3
2 年前
https://static.github-zh.com/github_avatars/weiyx16?size=40
weiyx16 / RL_Toy_Example

Deep RL toy example based on gym package with several methods

gymsarsasarsa-lambdaa2ca3c
Python 3
5 年前
https://static.github-zh.com/github_avatars/klima7?size=40
klima7 / Reversi-RL

Reversi game with multiple reinforcement learning algorithms.

mctsq-learningreinforcement-learningsarsasarsa-lambda
Python 2
3 年前
https://static.github-zh.com/github_avatars/PranavNyati?size=40
PranavNyati / Reinforcement_Learning_Term_Project

This repository contains the codes for Term Projects as part of the Reinforcement Learning course (CS600077) that I am taking in the Autumn 2023 semester at IIT Kharagpur

markov-decision-processesq-learningreinforcement-learningsarsa-lambda
Python 2
1 年前
https://static.github-zh.com/github_avatars/arielfayol37?size=40
arielfayol37 / Easy21

Applying reinforcement learning methods to a simple card game.

collectible-card-gamesarsa-lambda
Python 1
10 个月前
https://static.github-zh.com/github_avatars/Hysperr?size=40
Hysperr / gridworld-v2

An agent learns the optimal path towards its goal from any starting point while avoiding obstacles.

人工智能C#markov-decision-processespathfindingq-learningsarsa-lambda
C# 1
1 年前
https://static.github-zh.com/github_avatars/pagrim?size=40
pagrim / FrozenLake

Q-learning agent to solve the frozen lake problem from the OpenAI gym

q-learningsarsasarsa-lambdareinforcement-learning
Python 1
3 年前
https://static.github-zh.com/github_avatars/valinsogna?size=40
valinsogna / ReinForceMate

Advanced RL algorithms for two simplified versions of chess. Shortest Path finds the minimal moves between two cells based on piece capabilities. Capture Pieces trains against random opponents aiming ...

deep-q-learningq-learningreinforcement-learningsarsasarsa-lambda
Jupyter Notebook 1
2 年前
loading...