GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

grounding

Website
Wikipedia
https://static.github-zh.com/github_avatars/simular-ai?size=40
simular-ai / Agent-S

Agent S: an open agentic framework that uses computers like a human

agent-computer-interfaceai-agentscomputer-automationgui-agentsmemorymllmplanningretrieval-augmented-generationin-context-reinforcement-learningcomputer-usegrounding
Python 5.44 k
6 天前
https://static.github-zh.com/github_avatars/BAAI-Agents?size=40
BAAI-Agents / Cradle

#大语言模型#The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, ...

ai-agentai-agents-frameworkcomputer-controlcradlegccgenerative-aigroundinglarge-language-models大语言模型lmmmultimodalityvision-language-modelvlm人工智能
Python 2.11 k
7 个月前
https://static.github-zh.com/github_avatars/TheShadow29?size=40
TheShadow29 / awesome-grounding

#自然语言处理#awesome grounding: A curated list of research papers in visual grounding

机器视觉自然语言处理groundingAwesome Listspapersarxivvideo-understandingcaptioning-videosembodied-agentmultimodal-deep-learninglanguage-groundingBukkit
1.08 k
2 年前
https://static.github-zh.com/github_avatars/mees?size=40
mees / calvin

#自然语言处理#CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

自然语言处理Robotics深度学习groundingvision-languagemanipulation机器视觉PyTorchvisionvision-and-language
Python 585
4 个月前
https://static.github-zh.com/github_avatars/FoundationVision?size=40
FoundationVision / Groma

#大语言模型#[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

grounding大语言模型mllmlarge-language-modelsfoundation-modelsllamallama2multimodalvision-language-model
Python 567
1 年前
https://static.github-zh.com/github_avatars/cliport?size=40
cliport / cliport

#自然语言处理#CLIPort: What and Where Pathways for Robotic Manipulation

clipRoboticsvision深度学习自然语言处理groundingvision-languagemanipulationPyTorchrearrangement机器视觉
Jupyter Notebook 498
2 年前
https://static.github-zh.com/github_avatars/allenai?size=40
allenai / lumos

Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"

decision-makinggroundingmathsplanningquestion-answeringreasoningweb-agent
Python 464
1 年前
https://static.github-zh.com/github_avatars/flowersteam?size=40
flowersteam / Grounding_LLMs_with_online_RL

We perform functional grounding of LLMs' knowledge in BabyAI-Text

groundinglanguage-modelreinforcement-learning
Python 261
10 个月前
https://static.github-zh.com/github_avatars/mbzuai-oryx?size=40
mbzuai-oryx / Video-LLaVA

#大语言模型#PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models

大语言模型lmmVideogroundingtranscription
Python 256
1 年前
https://static.github-zh.com/github_avatars/linhuixiao?size=40
linhuixiao / Awesome-Visual-Grounding

[TPAMI reviewing] Towards Visual Grounding: A Survey

groundingAwesome Listssurvey
Shell 166
3 个月前
https://static.github-zh.com/github_avatars/linhuixiao?size=40
linhuixiao / CLIP-VG

[TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.

groundingclip
Jupyter Notebook 122
5 个月前
https://static.github-zh.com/github_avatars/TIGER-AI-Lab?size=40
TIGER-AI-Lab / StructLM

#大语言模型#Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)

grounding大语言模型reasoning
Python 76
8 个月前
https://static.github-zh.com/github_avatars/lukashermann?size=40
lukashermann / hulc

#自然语言处理#Hierarchical Universal Language Conditioned Policies

机器视觉深度学习groundingmanipulation自然语言处理PyTorchRoboticsvisionvision-and-languagevision-language
Python 73
1 年前
https://static.github-zh.com/github_avatars/TheShadow29?size=40
TheShadow29 / zsgnet-pytorch

#自然语言处理#Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.07129)

groundingvision自然语言处理objects
Python 71
5 年前
https://static.github-zh.com/github_avatars/TheShadow29?size=40
TheShadow29 / vognet-pytorch

#自然语言处理#[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)

groundingVideopytorch-implementationvisionvision-and-language自然语言处理captioning-videos
Python 67
5 年前
https://static.github-zh.com/github_avatars/TheShadow29?size=40
TheShadow29 / VidSitu

#自然语言处理#[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)

visionvision-and-languagegrounding自然语言处理Videosrlcaptioning-videoscaptioning
Python 60
4 年前
https://static.github-zh.com/github_avatars/zjukg?size=40
zjukg / DUET

[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning

pretrained-language-modelPyTorchtransformerzero-shot-learningcross-modalgroundingsemantic
Python 52
1 年前
https://static.github-zh.com/github_avatars/linhuixiao?size=40
linhuixiao / HiVG

[ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.

clipgrounding
Python 50
2 个月前
https://static.github-zh.com/github_avatars/mees?size=40
mees / hulc2

#自然语言处理#[ICRA2023] Grounding Language with Visual Affordances over Unstructured Data

机器视觉深度学习groundingmanipulation自然语言处理PyTorchRoboticsvisionvision-and-languagevision-language
Python 43
2 年前
https://static.github-zh.com/github_avatars/yuleiniu?size=40
yuleiniu / vc

Code for CVPR'18 "Grounding Referring Expressions in Images by Variational Context"

cvpr2018Tensorflowgrounding
Python 30
7 年前
loading...