AgentIndex icon
AgentIndex
ToolsCategoriesTrendingNewCompare
Submit Tool
Home/
Vision / Multimodal/
Grok-MCP
Grok-MCP logo

Grok-MCP

Active·★ 33·MIT·Updated 2026-05-23
★ Trending★ Vision / Multimodal★ API Integration

MCP server for xAI's Grok API with agentic tool calling, image and video generation, vision, and file support.

Grok-MCP is an MCP server for xAI's Grok API, offering advanced agentic capabilities. It supports diverse functionalities including web search, X search, code execution, multimodal interaction, and comprehensive file management.

#Grok API#AI Agent#Tool Calling#Multimodal AI#Content Generation#Vision API#File Management#Python
$ Install
$ git clone https://github.com/merterbak/Grok-MCP.git && cd Grok-MCP && uv sync
↗ Visit site★ GitHub
01

Features

01Agentic Tool Calling (Web, X, Code Execution)
02Multimodal Content Generation (Image & Video)
03Vision Capabilities for Image Analysis
04Comprehensive Files API for Document Interaction
05Stateful Conversations for Context Retention
02

Compatibility

Python
Runtime
Verified via docs
Docker
Containerization
Verified via docs
macOS
Operating System
Verified via docs
Linux
Operating System
Verified via docs
Windows
Operating System
Verified via docs
03

Quick start

1
$ git clone https://github.com/merterbak/Grok-MCP.git
2
$ cd Grok-MCP
3
$ uv sync
04

Use cases

↳Develop AI agents with external knowledge access (web, X) and computational abilities.
↳Integrate multimodal AI features like image/video creation and vision analysis into applications.
↳Build AI assistants capable of understanding, summarizing, and chatting with uploaded documents.
↳Enhance conversational AI with persistent memory and state management for continuous interactions.
05

Alternatives

ragflow logo
ragflow★ 81.5k
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
vs →
n8n logo
n8n★ 190.2k
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
vs →
Microsoft AutoGen logo
Microsoft AutoGen★ 58.5k
A framework that enables the development of LLM applications using multiple agents that can converse with each other to solve tasks.
vs →
CrewAI logo
CrewAI★ 52.4k
Framework for orchestrating role-playing, autonomous AI agents. By working together, your Crew can tackle complex tasks.
vs →
Gemini CLI logo
Gemini CLI★ 104.7k
An open-source AI agent that brings the power of Gemini directly into your terminal. Supports native MCP.
vs →
dagster logo
dagster★ 15.6k
An orchestration platform for the development, production, and observation of data assets.
vs →
Scrapling logo
Scrapling★ 55.0k
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
vs →
ChatTTS logo
ChatTTS★ 39.4k
A generative speech model for daily dialogue.
vs →
See all alternatives →

Related searches

Grok-MCP AlternativesBest Vision / Multimodal Tools 2026Open Source Vision / MultimodalGrok-MCP TutorialGrok-MCP Vs CompetitorsGrok APIAI AgentTool Calling

Comments

Log in to leave a comment
  • K
    Kai LewisApr 23, 2026

    Grok API access via MCP with agentic tool calling built in.

  • R
    Rebel AndersonMar 27, 2026

    Image and video generation alongside text — multi-modal from the same interface.

  • M
    Morgan GarciaMar 23, 2026

    Good for workflows that need xAI's models specifically.

  • S
    Sam KimFeb 28, 2026

    Tool calling implementation is clean, composable with other MCP servers.

On this page
01Features02Compatibility03Quick start04Use cases05Alternatives
Stats
GitHub Stars★ 33
Last commit1w ago
StatusActive
LicenseMIT
CategoryVision / Multimodal
Trend (30d)
+1.3↑ 0.7%
Links
Documentation↗Discussion↗Issues↗Releases↗

Deploy on DigitalOcean — Get $200 Free Credit

Ad
© 2026 AgentIndex.app|Built by a 10-year iOS Developer.
QYSGitHubBuy me a coffee ☕

Browse by Category

Code AssistantWorkflow AutomationRAG / Knowledge BaseMulti-AgentBrowser AutomationLLM InfraDev ToolingObservability

Not affiliated with Anthropic, OpenAI or Microsoft.