GitHub - bytedance/UI-TARS-desktop: The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
2025-08-31
UI-TARS-desktop is ByteDance's open-source multimodal AI agent stack comprising Agent TARS (a CLI and Web UI for terminal/browser/computer automation) and UI-TARS Desktop (a native desktop application for local and remote computer/browser control). The system leverages cutting-edge multimodal LLMs integrated with MCP tools to enable human-like task completion through GUI agents and vision capabilities, with recent updates adding streaming support, runtime analytics, and isolated execution environments.
Was this useful?