#

computer-use

https://static.github-zh.com/github_avatars/bytedance?size=40

Agent TARS 是一个通用的多模态 AI Agent Stack,它将 GUI Agent 和 Vision 的强大功能带入你的终端、计算机、浏览器和产品中。UI-TARS Desktop 是一个桌面应用程序,基于 UI-TARS 模型提供原生的 GUI Agent。

TypeScript 18.75 k
2 小时前
https://static.github-zh.com/github_avatars/web-infra-dev?size=40
TypeScript 10.29 k
3 天前
Upsonic/Upsonic
https://static.github-zh.com/github_avatars/Upsonic?size=40
Python 7.66 k
4 天前
https://static.github-zh.com/github_avatars/bytebot-ai?size=40

#大语言模型#Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment.

TypeScript 7.06 k
3 天前
https://static.github-zh.com/github_avatars/A9T9?size=40

Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use/LLM. Selenium IDE import/export.

JavaScript 1.71 k
5 个月前
https://static.github-zh.com/github_avatars/e2b-dev?size=40
Python 1.55 k
3 个月前
https://static.github-zh.com/github_avatars/showlab?size=40

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Python 1.47 k
4 个月前
https://static.github-zh.com/github_avatars/trycua?size=40

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

1.41 k
4 个月前
https://static.github-zh.com/github_avatars/trycua?size=40

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

1.41 k
4 个月前
OpenAdaptAI/OpenAdapt
https://static.github-zh.com/github_avatars/OpenAdaptAI?size=40

Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models

Python 1.38 k
6 个月前
https://static.github-zh.com/github_avatars/zai-org?size=40

An open-sourced end-to-end VLM-based GUI Agent

Python 1.05 k
5 个月前
https://static.github-zh.com/github_avatars/deedy?size=40

A fork of Anthropic Computer Use that you can run on Mac computers to give Claude and other AI models autonomous access to your computer.

Python 822
9 个月前
https://static.github-zh.com/github_avatars/microsoft?size=40

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Python 765
5 个月前
https://static.github-zh.com/github_avatars/instavm?size=40

A framework to enable autonomous android and computer use using any LLM (local or remote)

Python 499
2 个月前
https://static.github-zh.com/github_avatars/instavm?size=40

A framework to enable autonomous android and computer use using any LLM (local or remote)

Python 499
2 个月前
https://static.github-zh.com/github_avatars/suitedaces?size=40

Desktop app powered by Claude’s computer use capability to control your computer

Python 489
8 个月前
https://static.github-zh.com/github_avatars/baryhuang?size=40

The only general AI agent that does NOT requires extra API key, giving you full control on your local and remote MacOs from Claude Desktop App

Python 379
3 个月前
https://static.github-zh.com/github_avatars/OS-Agent-Survey?size=40

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).

348
1 个月前
loading...
Website
Wikipedia