GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

gui-agent

Website
Wikipedia
https://static.github-zh.com/github_avatars/trycua?size=40
trycua / acu

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

人工智能Awesome Listscomputercomputer-usegui-agent
1.3 k
1 个月前
https://static.github-zh.com/github_avatars/showlab?size=40
showlab / ShowUI

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

computer-usevision-language-modelagentgui-agent
Python 1.3 k
18 天前
https://static.github-zh.com/github_avatars/THUDM?size=40
THUDM / CogAgent

An open-sourced end-to-end VLM-based GUI Agent

gui-agentcomputer-usevlmagentglm
Python 969
2 个月前
https://static.github-zh.com/github_avatars/francedot?size=40
francedot / acu

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

人工智能Awesome Listscomputercomputer-usegui-agent
379
5 个月前
https://static.github-zh.com/github_avatars/OS-Agent-Survey?size=40
OS-Agent-Survey / OS-Agent-Survey

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025).

surveyagent大语言模型GUIgui-agentcomputer-usephone-usebrowser-agentweb-agentoperator
289
1 个月前
https://static.github-zh.com/github_avatars/OS-Agent-Survey?size=40
OS-Agent-Survey / OS-Agent-Survey

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use".

surveyagent大语言模型GUIgui-agentcomputer-usephone-usebrowser-agentweb-agent
165
5 个月前
https://static.github-zh.com/github_avatars/ritzz-ai?size=40
ritzz-ai / GUI-R1

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

deep-reinforcement-learninggui-agentlarge-multimodal-modelsmultimodalmultimodal-large-language-modelsgrpoo1
Python 110
1 个月前
https://static.github-zh.com/github_avatars/lll6gg?size=40
lll6gg / UI-R1

Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"

gui-agentmultimodal-large-language-modelsmultimodal-learningreinforcement-learning
Python 107
20 天前
https://static.github-zh.com/github_avatars/iMeanAI?size=40
iMeanAI / open-source-operator

Create your self-hosted, open-source Operator model.

browserusegui-agent
Python 96
2 个月前
https://static.github-zh.com/github_avatars/showlab?size=40
showlab / GUI-Thinker

Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.

agentsgui-agentgui-applicationlarge-multimodal-models
Python 70
2 个月前
https://static.github-zh.com/github_avatars/TongUI-agent?size=40
TongUI-agent / TongUI-agent

Release of code, datasets and model for our work TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials

vision-language-modelagentcomputer-usegui-agent
HTML 21
18 天前
https://static.github-zh.com/github_avatars/wendell0218?size=40
wendell0218 / GVA-Survey

#大语言模型#Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms

embodied-agentgui-agentgva大语言模型mllmmulti-agent-systemsurveyvlm
18
3 个月前
https://static.github-zh.com/github_avatars/Yah185?size=40
Yah185 / open-source-operator

Create your self-hosted, open-source Operator model.

browserusegui-agent
0
5 天前