Gemini CLI: Gemini CLI is Google's open-source AI agent for the terminal, providing direct access to Gemini models with a free tier of 60 requests/minute and 1,000/day. It ships with built-in tools for Google Search grounding, file operations, shell command execution, and web fetching, plus native MCP support for custom integrations. Built for developers who work primarily in the command line.; UI-TARS-desktop: UI-TARS Desktop is the desktop application component of the TARS multimodal AI agent stack. It provides a native GUI agent that can understand and interact with your computer's user interface by seeing the screen, running shell commands, and using browser tools. Powered by cutting-edge multimodal LLMs with MCP integration for extending agent capabilities.
Running AI-assisted development tasks directly in the terminal without switching contexts
Automating desktop GUI workflows that don't have APIs by seeing and clicking the UI