Screenhand
Give AI eyes and hands on your desktop. Open-source MCP server for desktop automation — screenshots, UI control, browser automation, OCR. Works with Claude, Cursor, and any MCP client. macOS + Windows.
ScreenHand is an open-source MCP server providing native desktop control for AI agents on macOS and Windows. It integrates Accessibility APIs, UI Automation, OCR, and Chrome DevTools Protocol to enable fast, robust interaction with applications and browsers, including multi-agent coordination and background job processing.
Features
Compatibility
Quick start
Use cases
Alternatives
Related searches
Comments
- JJamie MartinezMay 22, 2026
More reliable than screenshot-based approaches for detecting UI state changes
- JJamie ZhangApr 2, 2026
Used for UI automation testing workflows that require actual screen interaction
- RRowan KimMar 22, 2026
Open eyes and hands for desktop automation via MCP is powerful for automation use cases
- DDylan WilsonMar 15, 2026
The open-source approach means you can audit exactly what the AI agent is doing on screen