GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

gui-agents

Website
Wikipedia
https://static.github-zh.com/github_avatars/bytedance?size=40
bytedance / UI-TARS-desktop

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

agentvlmElectronvisionVitecomputer-usegui-agentsmcpmcp-server
TypeScript 14.62 k
21 小时前
https://static.github-zh.com/github_avatars/simular-ai?size=40
simular-ai / Agent-S

Agent S: an open agentic framework that uses computers like a human

agent-computer-interfaceai-agentscomputer-automationgui-agentsmemorymllmplanningretrieval-augmented-generationin-context-reinforcement-learningcomputer-usegrounding
Python 5.44 k
6 天前
https://static.github-zh.com/github_avatars/showlab?size=40
showlab / Awesome-GUI-Agent

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

ai-assistantAwesome Listsgui-agentsGUIllm-agent
722
15 天前
https://static.github-zh.com/github_avatars/OSU-NLP-Group?size=40
OSU-NLP-Group / UGround

[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents

人工智能gui-agents
Python 242
19 天前
https://static.github-zh.com/github_avatars/eric-ai-lab?size=40
eric-ai-lab / Screen-Point-and-Read

Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"

ai-agentsgroundingscreen-readergui-agents
Python 28
1 年前
https://static.github-zh.com/github_avatars/philfung?size=40
philfung / awesome-computer-use

#大语言模型#Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.

computer-use机器视觉大语言模型tool-usevisionrpaanthropicanthropic-claudegpt-4-visiongui-agents
22
7 个月前
https://static.github-zh.com/github_avatars/runamu?size=40
runamu / monday

[CVPR 2025] Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents

agentcomputer-use机器视觉gui-agentsvision-language-modelvlm
Python 16
13 天前
https://static.github-zh.com/github_avatars/alaa-nadi?size=40
alaa-nadi / UI-TARS-desktop

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

agentbrowser-usecomputer-useElectrongui-agentsmcpmcp-servervisionVitevlm
TypeScript 1
8 天前
https://static.github-zh.com/github_avatars/elesxx?size=40
elesxx / Agent-S

Agent S: an open agentic framework that uses computers like a human

agentagent-based-modelaiagentscomputer-automationdeepseek-r1developer-toolsfunction-callinggroundinggui-agentsmllmsshswarmTypeScript
Python 0
1 个月前