AgentIndex icon
AgentIndex
ToolsCategoriesTrendingNewCompare
Submit Tool
Home/
Vision / Multimodal/
pdf-mcp
pdf-mcp logo

pdf-mcp

Active·★ 45·MIT·Updated 2026-05-29
★ Trending★ Code Assistant★ RAG / Knowledge Base

MCP server that lets Claude Code and other AI agents read large PDFs without hitting context limits. Chunked reading, hybrid search, OCR, table and image extraction, SQLite cache.

pdf-mcp is a Model Context Protocol (MCP) server that enables AI agents to read, search, and extract content from PDF files. It uses PyMuPDF for PDF parsing, SQLite for persistent caching, and supports hybrid search combining BM25 keyword and semantic embeddings, OCR for scanned documents, and structured extraction of tables and images.

#agentic-ai#ai#claude#codex-cli#copilot#document-processing#llm#mcp
$ Install
$ pip install pdf-mcp
↗ Visit site★ GitHub
01

Features

01Hybrid search (BM25 keyword + semantic embeddings) with Reciprocal Rank Fusion
02Paginated reading to avoid context overflow
03OCR support for scanned and image-based PDFs via Tesseract
04Structured extraction of tables, images, and table of contents
05Persistent SQLite cache with automatic invalidation
02

Compatibility

Claude Code
Claude Code
Verified via docs
Claude Desktop
Claude Desktop
Verified via docs
Visual Studio Code
VS Code
Verified via docs
Codex CLI
Codex CLI
Verified via docs
Kiro
Kiro
Verified via docs
03

Quick start

1
$ pip install pdf-mcp
04

Use cases

↳Efficiently read and analyze large PDF documents without exceeding context limits
↳Search for specific content or concepts within PDFs using natural language
↳Extract structured data such as tables and images from PDFs
05

Alternatives

ragflow logo
ragflow★ 81.5k
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
vs →
n8n logo
n8n★ 190.2k
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
vs →
fastmcp logo
fastmcp★ 25.4k
🚀 The fast, Pythonic way to build MCP servers and clients.
vs →
nuclear logo
nuclear★ 17.7k
Streaming music player that finds free music for you
vs →
context-mode logo
context-mode★ 16.0k
Context window optimization for AI coding agents. Sandboxes tool output, 98% reduction. 12 platforms
vs →
Auto-claude-code-research-in-sleep logo
Auto-claude-code-research-in-sleep★ 11.0k
ARIS ⚔️ (Auto-Research-In-Sleep) — Claude Code skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation via Codex MCP
vs →
agents-best-practices logo
agents-best-practices★ 1.1k
Provider-neutral Agent Skill for Codex, Claude Code, and agentic harness design.
vs →
holaOS logo
holaOS★ 5.4k
The agent environment for long-horizon work, continuity, and self-evolution.
vs →
See all alternatives →

Related searches

pdf-mcp AlternativesBest Vision / Multimodal Tools 2026Open Source Vision / Multimodalpdf-mcp Tutorialpdf-mcp Vs Competitorsagentic-aiaiclaude

Comments

Log in to leave a comment
  • J
    Jamie HarrisMay 5, 2026

    Good for research workflows where Claude needs to process many large documents efficiently

  • Quinn Kim
    Quinn KimApr 29, 2026

    Reading large PDFs without hitting context limits is a practical problem well solved here

  • S
    Sage GarciaApr 23, 2026

    The chunking approach handles technical papers and long documents reliably

  • S
    Spencer ZhangApr 14, 2026

    Used for automated literature review workflows, PDF parsing accuracy is high

On this page
01Features02Compatibility03Quick start04Use cases05Alternatives
Stats
GitHub Stars★ 45
Last commit1d ago
StatusActive
LicenseMIT
CategoryVision / Multimodal
Trend (30d)
+1.8↑ 0.7%
Links
Documentation↗Discussion↗Issues↗Releases↗

Deploy on DigitalOcean — Get $200 Free Credit

Ad
© 2026 AgentIndex.app|Built by a 10-year iOS Developer.
QYSGitHubBuy me a coffee ☕

Browse by Category

Code AssistantWorkflow AutomationRAG / Knowledge BaseMulti-AgentBrowser AutomationLLM InfraDev ToolingObservability

Not affiliated with Anthropic, OpenAI or Microsoft.