GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

gui-agent

Website
Wikipedia
https://static.github-zh.com/github_avatars/bytedance?size=40
bytedance / UI-TARS-desktop

The Open-sourced Multimodal AI Agent Stack connecting Cutting-edge AI Models and Agent Infra.

agentvlmvisioncomputer-usemcpmcp-servergui-operatorbrowser-usegui-agentmultimodaltarsui-tarsagent-tars
TypeScript 15.4 k
1 天前
https://static.github-zh.com/github_avatars/showlab?size=40
showlab / ShowUI

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

computer-usevision-language-modelagentgui-agent
Python 1.4 k
2 个月前
https://static.github-zh.com/github_avatars/trycua?size=40
trycua / acu

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

人工智能Awesome Listscomputercomputer-usegui-agent
1.36 k
3 个月前
https://static.github-zh.com/github_avatars/trycua?size=40
trycua / acu

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

人工智能Awesome Listscomputercomputer-usegui-agent
1.36 k
3 个月前
https://static.github-zh.com/github_avatars/zai-org?size=40
zai-org / CogAgent

An open-sourced end-to-end VLM-based GUI Agent

gui-agentcomputer-usevlmagentglm
Python 1.01 k
4 个月前
https://static.github-zh.com/github_avatars/OS-Agent-Survey?size=40
OS-Agent-Survey / OS-Agent-Survey

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).

surveyagent大语言模型GUIgui-agentcomputer-usephone-usebrowser-agentweb-agentoperator
312
1 个月前
https://static.github-zh.com/github_avatars/OS-Agent-Survey?size=40
OS-Agent-Survey / OS-Agent-Survey

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).

surveyagent大语言模型GUIgui-agentcomputer-usephone-usebrowser-agentweb-agentoperator
312
1 个月前
https://static.github-zh.com/github_avatars/ritzz-ai?size=40
ritzz-ai / GUI-R1

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

deep-reinforcement-learninggui-agentlarge-multimodal-modelsmultimodalmultimodal-large-language-modelsgrpoo1
Python 152
3 个月前
https://static.github-zh.com/github_avatars/lll6gg?size=40
lll6gg / UI-R1

Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"

gui-agentmultimodal-large-language-modelsmultimodal-learningreinforcement-learning
Python 120
2 个月前
https://static.github-zh.com/github_avatars/iMeanAI?size=40
iMeanAI / open-source-operator

Create your self-hosted, open-source Operator model.

browserusegui-agent
Python 100
4 个月前
https://static.github-zh.com/github_avatars/showlab?size=40
showlab / WorldGUI

Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.

agentsgui-agentgui-applicationlarge-multimodal-models
Python 87
4 天前
https://static.github-zh.com/github_avatars/wendell0218?size=40
wendell0218 / GVA-Survey

#大语言模型#Official repository of the paper "Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms"

embodied-agentgui-agentgva大语言模型mllmmulti-agent-systemsurveyvlm
78
20 天前
https://static.github-zh.com/github_avatars/TongUI-agent?size=40
TongUI-agent / TongUI-agent

Release of code, datasets and model for our work TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials

vision-language-modelagentcomputer-usegui-agent
HTML 42
20 天前
https://static.github-zh.com/github_avatars/open-compass?size=40
open-compass / MMBench-GUI

Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent with a hierarchical manner across multiple platforms, including ...

benchmark-frameworkcomputer-usegui-agentvision-language-model
Python 40
7 天前
https://static.github-zh.com/github_avatars/V-Droid-Agent?size=40
V-Droid-Agent / V-Droid

#大语言模型#Source code of the paper "V-Droid: Advancing Mobile GUI Agent Through Generative Verifiers"

agentcomputer-usegui-agent大语言模型phone-use移动mobile-agents
Python 4
23 天前
https://static.github-zh.com/github_avatars/Yah185?size=40
Yah185 / open-source-operator

Create your self-hosted, open-source Operator model.

browserusegui-agent
0
13 天前
https://static.github-zh.com/github_avatars/jamal22552?size=40
jamal22552 / AI-Infra

#大语言模型#Explore the AI-Infra repository for a structured learning path and a visual landscape of modern AI infrastructure in Kubernetes and cloud-native ecosystems. 🌐💻

agent-tars人工智能browser-usecloud-computingcomputer-use深度学习distributed-cloudgenaigui-agentKubernetes大语言模型机器学习mlsysopenaitarsui-tarsvlm
0
17 天前