ragflow: RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that integrates RAG with Agent capabilities. It provides a superior context layer for LLMs and offers a streamlined RAG workflow adaptable to enterprises of any scale.; kreuzberg: Kreuzberg is a high-performance, polyglot library designed to extract text and metadata from over 57 file formats, including comprehensive OCR capabilities. Built with a Rust core, it offers native speed processing, memory efficiency, and the ability to generate embeddings without requiring a GPU, making it highly versatile for various data extraction and processing tasks.
Building high-fidelity, production-ready AI systems with complex data.
Automated extraction of text, metadata, and structured data from diverse document types.