playwright-mcp: Playwright MCP is a server that leverages Playwright's browser automation capabilities to enable Large Language Models (LLMs) to interact with web pages. It utilizes structured accessibility snapshots, allowing LLMs to process web content without relying on visual input or traditional screenshots.; UI-TARS-desktop: UI-TARS Desktop is the desktop application component of the TARS multimodal AI agent stack. It provides a native GUI agent that can understand and interact with your computer's user interface by seeing the screen, running shell commands, and using browser tools. Powered by cutting-edge multimodal LLMs with MCP integration for extending agent capabilities.
Enabling LLM-powered web browsing and task completion
Automating desktop GUI workflows that don't have APIs by seeing and clicking the UI