embodied-agent

#Awesome#Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

Awesome Lists embodied-agent embodied-ai foundation-model foundation-models generative-agents generative-ai generative-model generative-models 大语言模型 large-language-models ChatGPT gpt-4

2.12 k

5 个月前

zchoi / Awesome-Embodied-Robotics-and-Agent

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

embodied-agent embodied-ai scene-understanding navigation planning-algorithms Awesome Lists agent 大语言模型

1.52 k

2 个月前

TheShadow29 / awesome-grounding

#自然语言处理#awesome grounding: A curated list of research papers in visual grounding

机器视觉自然语言处理 grounding Awesome Lists papers arxiv video-understanding captioning-videos embodied-agent multimodal-deep-learning language-grounding Bukkit

1.11 k

2 个月前

tmgthb / Autonomous-Agents

#大语言模型#Autonomous Agents (LLMs) research papers. Updated Daily.

autonomous-agents agents 人工智能 ai-agents embodied-agent 大语言模型 llm-agents research-paper agent agentic aiagent aiagents

1.01 k

3 天前

eric-ai-lab / awesome-vision-language-navigation

A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

vision-and-language navigation embodied-agent

539

1 年前

kyegomez / RT-2

Democratization of RT-2 "RT-2: New model translates vision and language into action"

人工智能 attention-mechanism embodied-agent gpt4 multi-modal Robotics transformer

Python 509

1 年前

haoranD / Awesome-Embodied-AI

#学习与技能提升#A curated list of awesome papers on Embodied AI and related research/industry-driven resources.

Awesome Lists classification detection embodied-agent embodied-ai learning navigation perception recognition segmentation visual vlm

467

3 个月前

RobotecAI / rai

#大语言模型#RAI is a vendor-agnostic agentic framework for robotics, utilizing ROS 2 tools to perform complex actions, defined scenarios, free interface execution, log summaries, voice interaction and more.

ai-agents-framework embodied-agent embodied-ai generative-ai Robotics ros2 大语言模型 multimodal vlm 人工智能

Python 374

3 天前

allenai / allenact

#计算机科学#An open source framework for research in Embodied-AI from AI2.

reinforcement-learning embodied-agent 人工智能 research Python 深度学习 ai2 机器视觉

Python 369

24 天前

zju-vipa / Odyssey

#大语言模型#Odyssey: Empowering Minecraft Agents with Open-World Skills

agent large-language-models 大语言模型我的世界 embodied-agent llm-agent fine-tuning

Python 336

3 个月前

mbodiai / embodied-agents

#大语言模型#Seamlessly integrate state-of-the-art transformer models into robotics stacks

large-language-models 大语言模型 Robotics transformer vision-language-model vlm 人工智能 diffusion generative-ai agents multimodal embodied-agent

Python 238

3 个月前

Yuxing-Wang-THU / SurveyBrainBody

Brain-Body Co-Design for Embodied Agents: Taxonomy, Frontiers, and Challenges

embodied-agent Robotics survey agent evolution

193

2 天前

Gary3410 / TaPA

[arXiv 2023] Embodied Task Planning with Large Language Models

embodied-agent llama Robotics

Python 190

2 年前

iris0329 / SeeGround

[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding

vlm zero-shot 3d-scene-understanding embodied-agent embodied-ai

Python 171

5 个月前

hanxunyu / Inst3D-LMM

[CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning"

embodied-agent scene-understanding multi-task-learning

Python 112

1 个月前

AoqunJin / Awesome-VLA-Post-Training

A collection of vision-language-action model post-training methods.

embodied-agent embodied-ai fine-tuning post-training

18 天前

Zhoues / MineDreamer

[IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control "

diffusion-model embodied-agent 我的世界

Python 95

3 个月前

bigai-nlco / langsuite

Official Repo of LangSuitE

embodied-agent large-language-models autonomous-agents

Python 84

1 年前

wendell0218 / GVA-Survey

#大语言模型#Official repository of the paper "Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms"

embodied-agent gui-agent gva 大语言模型 mllm multi-agent-system survey vlm

2 个月前

mazpie / genrl

[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning them into sequences of latent world model states. Latent state se...

embodied-agent embodied-ai foundation-models multimodal reinforcement-learning world-models

Python 80

5 个月前

Website
Wikipedia