GitHub - bytedance/UI-TARS-desktop: The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

UI-TARS-desktop is ByteDance's open-source multimodal AI agent stack comprising Agent TARS (a CLI and Web UI for terminal/browser/computer automation) and UI-TARS Desktop (a native desktop application for local and remote computer/browser control). The system leverages cutting-edge multimodal LLMs integrated with MCP tools to enable human-like task completion through GUI agents and vision capabilities, with recent updates adding streaming support, runtime analytics, and isolated execution environments.

Visit Original Article →

⌘K

Start typing to search...

Search across content, newsletters, and subscribers