AgentIndex icon
AgentIndex
ToolsCategoriesTrendingNewCompare
Submit Tool
Home/
Vision / Multimodal/
gemini-webapi-mcp
gemini-webapi-mcp logo

gemini-webapi-mcp

Active·★ 29·AGPL-3.0·Updated 2026-05-29
★ Trending★ Vision / Multimodal★ API Integration

MCP server for Google Gemini — free image generation, editing & chat via browser cookies. No API keys needed.

This is an MCP server for Google Gemini, enabling image generation, editing, chat, and file analysis directly via browser cookies. It operates without requiring API keys and is entirely free.

#Gemini Web API#Image Generation#Image Editing#File Analysis#Chatbot#Cookie Authentication#Watermark Removal#Python
$ Install
$ uv run --with "gemini-webapi-mcp[watermark] @ git+https://github.com/AndyShaman/gemini-webapi-mcp.git" gemini-webapi-mcp
↗ Visit site★ GitHub
01

Features

01Image generation from text descriptions
022x image resolution upscaling
03Image editing with prompts
04File and URL analysis (video, image, PDF)
05Automatic watermark removal
02

Compatibility

Python
Runtime
Verified via docs
MCP
Framework
Verified via docs
Chrome
Browser
Verified via docs
03

Quick start

1
$ uv run --with "gemini-webapi-mcp[watermark] @ git+https://github.com/AndyShaman/gemini-webapi-mcp.git" gemini-webapi-mcp
04

Use cases

↳Generate new images from text prompts
↳Edit existing images by providing prompts and files
↳Analyze various file types (video, images, PDF) or web content via URL
↳Conduct multi-turn text conversations with Gemini
05

Alternatives

ragflow logo
ragflow★ 81.5k
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
vs →
n8n logo
n8n★ 190.2k
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
vs →
Microsoft AutoGen logo
Microsoft AutoGen★ 58.5k
A framework that enables the development of LLM applications using multiple agents that can converse with each other to solve tasks.
vs →
CrewAI logo
CrewAI★ 52.4k
Framework for orchestrating role-playing, autonomous AI agents. By working together, your Crew can tackle complex tasks.
vs →
dagster logo
dagster★ 15.6k
An orchestration platform for the development, production, and observation of data assets.
vs →
Scrapling logo
Scrapling★ 55.0k
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
vs →
ChatTTS logo
ChatTTS★ 39.4k
A generative speech model for daily dialogue.
vs →
FastMCP logo
FastMCP★ 25.4k
The fast, Pythonic way to build MCP servers and clients. Designed by the Pydantic team for type safety and speed.
vs →
See all alternatives →

Related searches

gemini-webapi-mcp AlternativesBest Vision / Multimodal Tools 2026Open Source Vision / Multimodalgemini-webapi-mcp Tutorialgemini-webapi-mcp Vs CompetitorsGemini Web APIImage GenerationImage Editing

Comments

Log in to leave a comment
  • M
    Marlowe BrownMay 24, 2026

    Image editing alongside generation covers the common creative workflow.

  • P
    Phoenix HarrisApr 22, 2026

    Browser-based approach has latency tradeoffs but the price point is hard to beat.

  • R
    Rebel JacksonApr 11, 2026

    Free image generation via Gemini browser API without burning API credits is clever.

  • E
    Ellis PatelMar 14, 2026

    Good for prototyping image generation workflows before committing to API costs.

On this page
01Features02Compatibility03Quick start04Use cases05Alternatives
Stats
GitHub Stars★ 29
Last commit1d ago
StatusActive
LicenseAGPL-3.0
CategoryVision / Multimodal
Trend (30d)
+1.1↑ 0.7%
Links
Documentation↗Discussion↗Issues↗Releases↗

Deploy on DigitalOcean — Get $200 Free Credit

Ad
© 2026 AgentIndex.app|Built by a 10-year iOS Developer.
QYSGitHubBuy me a coffee ☕

Browse by Category

Code AssistantWorkflow AutomationRAG / Knowledge BaseMulti-AgentBrowser AutomationLLM InfraDev ToolingObservability

Not affiliated with Anthropic, OpenAI or Microsoft.