ragflow: RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that integrates RAG with Agent capabilities. It provides a superior context layer for LLMs and offers a streamlined RAG workflow adaptable to enterprises of any scale.; pdf-mcp: pdf-mcp is a Model Context Protocol (MCP) server that enables AI agents to read, search, and extract content from PDF files. It uses PyMuPDF for PDF parsing, SQLite for persistent caching, and supports hybrid search combining BM25 keyword and semantic embeddings, OCR for scanned documents, and structured extraction of tables and images.
Building high-fidelity, production-ready AI systems with complex data.
Efficiently read and analyze large PDF documents without exceeding context limits