GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

policy-learning

Website
Wikipedia
https://static.github-zh.com/github_avatars/OpenDriveLab?size=40
OpenDriveLab / End-to-end-Autonomous-Driving

[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving

end-to-end-autonomous-drivingautonomous-drivingpolicy-learningSimulation
3.13 k
6 个月前
https://static.github-zh.com/github_avatars/OpenDriveLab?size=40
OpenDriveLab / DriveAGI

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving

foundation-modelautonomous-drivingembodied-aipolicy-learningvideo-generationworld-models
Python 730
5 个月前
https://static.github-zh.com/github_avatars/zubair-irshad?size=40
zubair-irshad / Awesome-Robotics-3D

#大语言模型#A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

3Dbenchmarks机器视觉gaussian-splatting大语言模型manipulationnerfpolicy-learningpretrainingRoboticsscene-graphSimulationvision-language-modelvlmdiffusion-modelsfoundation-modelsnavigation
710
7 个月前
https://static.github-zh.com/github_avatars/DataCanvasIO?size=40
DataCanvasIO / YLearn

YLearn, a pun of "learn why", is a python package for causal inference

causal-inferencecausalitycausal-modelscausal-discoveryuplift-modelingpolicy-learning
Python 423
1 个月前
https://static.github-zh.com/github_avatars/OpenDriveLab?size=40
OpenDriveLab / PPGeo

[ICLR 2023] Pytorch implementation of PPGeo, a fully self-supervised driving policy pre-training framework to learn from unlabeled driving videos.

end-to-end-autonomous-drivingpolicy-learningself-supervised-learning
Python 127
2 年前
https://static.github-zh.com/github_avatars/OpenDriveLab?size=40
OpenDriveLab / MPI

[RSS 2024] Learning Manipulation by Predicting Interaction

policy-learningpre-trainingrobot-manipulation
Python 108
10 个月前
https://static.github-zh.com/github_avatars/metadriverse?size=40
metadriverse / ACO

[ECCV 2022] Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining

policy-learningpretraining机器视觉
Python 84
2 年前
https://static.github-zh.com/github_avatars/grf-labs?size=40
grf-labs / policytree

Policy learning via doubly robust empirical welfare maximization over trees

causal-inferencepolicy-learning
R 81
1 年前
https://static.github-zh.com/github_avatars/mrana6?size=40
mrana6 / euclideanizing_flows

Stable dynamical system (motion policy) learning using Euclideanizing flows

imitation-learningpolicy-learningdynamical-systems
Python 13
5 年前
https://static.github-zh.com/github_avatars/robot-learning-freiburg?size=40
robot-learning-freiburg / TAPAS

PyTorch code for TAPAS-GMM.

人工智能imitation-learningpolicy-learningPyTorchRobotics
Jupyter Notebook 11
7 个月前
https://static.github-zh.com/github_avatars/CausalML?size=40
CausalML / doubly-robust-dropel

#计算机科学#Off-Policy Evaluation and Learning that is both Doubly Robust and Distributionally Robust.

机器学习policy-learningrobustness
Jupyter Notebook 9
3 年前
https://static.github-zh.com/github_avatars/mhr?size=40
mhr / kcpo-icml

Experiment code for "Koopman Constrained Policy Optimization: a Koopman operator theoretic method for differentiable optimal control in robotics" as presented at ICML 2023

mpcoptimal-controlpolicy-learningrobot-learning
Jupyter Notebook 8
2 年前
https://static.github-zh.com/github_avatars/max-eth?size=40
max-eth / racer

Black-box, gradient-free optimization of car-racing policies.

gympolicy-learningoptimization
Python 3
5 年前
https://static.github-zh.com/github_avatars/xiaobaobaochifan?size=40
xiaobaobaochifan / NAC

#计算机科学#The official repository for Net Actor-Critic

decision-making机器学习optimal-transportpolicy-learningreinforcement-learningoffline-reinforcement-learning
Python 2
3 个月前
https://static.github-zh.com/github_avatars/suraj5424?size=40
suraj5424 / Q-Learning-for-Blackjack-in-different-environments

#计算机科学#This repository implements Q-Learning in Blackjack, comparing it with random action selection and basic strategies. Includes experiments with various strategies, rule variations, and deck numbers to e...

agent-based-modeling人工智能blackjack机器学习policy-learningq-learningreinforcement-learningsarsa
Jupyter Notebook 0
3 个月前
https://static.github-zh.com/github_avatars/aditKadepurkar?size=40
aditKadepurkar / basic_diffusion_policy

Implementation of a basic diffusion policy in jax with a full pipeline of data collection -> data augmentation -> training -> inference/evaluation

diffusionpolicy-learningRobotics
Python 0
5 个月前