GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

gui-agent

Website
Wikipedia
https://static.github-zh.com/github_avatars/bytedance?size=40
bytedance / UI-TARS-desktop

Agent TARS 是一个通用的多模态 AI Agent Stack,它将 GUI Agent 和 Vision 的强大功能带入你的终端、计算机、浏览器和产品中。UI-TARS Desktop 是一个桌面应用程序,基于 UI-TARS 模型提供原生的 GUI Agent。

agentvlmvisioncomputer-usemcpmcp-servergui-operatorbrowser-usegui-agentmultimodaltarsui-tarsagent-tars
TypeScript 18.69 k
31 分钟前
https://static.github-zh.com/github_avatars/showlab?size=40
showlab / ShowUI

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

computer-usevision-language-modelagentgui-agent
Python 1.47 k
4 个月前
https://static.github-zh.com/github_avatars/trycua?size=40
trycua / acu

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

人工智能Awesome Listscomputercomputer-usegui-agent
1.41 k
4 个月前
https://static.github-zh.com/github_avatars/trycua?size=40
trycua / acu

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

人工智能Awesome Listscomputercomputer-usegui-agent
1.41 k
4 个月前
https://static.github-zh.com/github_avatars/zai-org?size=40
zai-org / CogAgent

An open-sourced end-to-end VLM-based GUI Agent

gui-agentcomputer-usevlmagentglm
Python 1.05 k
5 个月前
https://static.github-zh.com/github_avatars/OS-Agent-Survey?size=40
OS-Agent-Survey / OS-Agent-Survey

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).

surveyagent大语言模型GUIgui-agentcomputer-usephone-usebrowser-agentweb-agentoperator
348
1 个月前
https://static.github-zh.com/github_avatars/OS-Agent-Survey?size=40
OS-Agent-Survey / OS-Agent-Survey

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).

surveyagent大语言模型GUIgui-agentcomputer-usephone-usebrowser-agentweb-agentoperator
348
1 个月前
https://static.github-zh.com/github_avatars/SunzeY?size=40
SunzeY / SEAgent

Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"

agentcomputer-use-agentgui-agentgrporlvllm
Python 187
1 个月前
https://static.github-zh.com/github_avatars/ritzz-ai?size=40
ritzz-ai / GUI-R1

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

deep-reinforcement-learninggui-agentlarge-multimodal-modelsmultimodalmultimodal-large-language-modelsgrpoo1
Python 179
4 个月前
https://static.github-zh.com/github_avatars/lll6gg?size=40
lll6gg / UI-R1

Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"

gui-agentmultimodal-large-language-modelsmultimodal-learningreinforcement-learning
Python 126
4 个月前
https://static.github-zh.com/github_avatars/InfiXAI?size=40
InfiXAI / InfiGUI-G1

#计算机科学#Official repository for InfiGUI-G1. We introduce Adaptive Exploration Policy Optimization (AEPO) to overcome semantic alignment bottlenecks in GUI agents through efficient, guided exploration.

机器视觉深度学习gui-agentlarge-language-modelsreinforcement-learning
Python 107
11 天前
https://static.github-zh.com/github_avatars/iMeanAI?size=40
iMeanAI / open-source-operator

Create your self-hosted, open-source Operator model.

browserusegui-agent
Python 100
5 个月前
https://static.github-zh.com/github_avatars/showlab?size=40
showlab / WorldGUI

Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.

agentsgui-agentgui-applicationlarge-multimodal-models
Python 94
2 个月前
https://static.github-zh.com/github_avatars/wendell0218?size=40
wendell0218 / GVA-Survey

#大语言模型#Official repository of the paper "Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms"

embodied-agentgui-agentgva大语言模型mllmmulti-agent-systemsurveyvlm
82
2 个月前
https://static.github-zh.com/github_avatars/open-compass?size=40
open-compass / MMBench-GUI

Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent with a hierarchical manner across multiple platforms, including ...

benchmark-frameworkcomputer-usegui-agentvision-language-model
Python 74
6 天前
https://static.github-zh.com/github_avatars/TurixAI?size=40
TurixAI / TuriX-CUA

This is the official website for TuriX Computer-use-Agent

agentai-agentscomputer-use-agentcuacomputer-automationmcpcomputer-usebrowser-usegui-agentgui-operator
Python 65
10 天前
https://static.github-zh.com/github_avatars/TongUI-agent?size=40
TongUI-agent / TongUI-agent

Release of code, datasets and model for our work TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials

computer-use-agentvision-language-modelagentcomputer-usegui-agent
HTML 47
2 个月前
https://static.github-zh.com/github_avatars/ahnjaewoo?size=40
ahnjaewoo / FlashAdventure

🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"

gui-agentagentcomputer-useembodied-agentvision-language-modelvlm
Python 11
12 天前
https://static.github-zh.com/github_avatars/V-Droid-Agent?size=40
V-Droid-Agent / V-Droid

#大语言模型#Source code of the paper "V-Droid: Advancing Mobile GUI Agent Through Generative Verifiers"

agentcomputer-usegui-agent大语言模型phone-use移动mobile-agents
Python 5
2 个月前
https://static.github-zh.com/github_avatars/Magic-Abracadabra?size=40
Magic-Abracadabra / AI-Chinese-Scripting-Language

This is a quick test of Chinese Scripting Language powered by AI. You can use it to open any text file. No illegal use is allowed! Free for commercial use and academic use.

context-engineeringgui-agentprompt-engineering
Python 2
1 个月前
loading...