[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use".
Code for "UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning"
Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
#大语言模型#Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms