kreuzberg: Kreuzberg is a high-performance, polyglot library designed to extract text and metadata from over 57 file formats, including comprehensive OCR capabilities. Built with a Rust core, it offers native speed processing, memory efficiency, and the ability to generate embeddings without requiring a GPU, making it highly versatile for various data extraction and processing tasks.; mindsdb: MindsDB is an open-source server that empowers AI, agents, and applications to obtain accurate answers from diverse, large-scale data sources. It features a robust architecture to connect and unify data from databases, data warehouses, and SaaS applications, and then respond to queries using built-in AI agents and its Model Context Protocol (MCP).
Automated extraction of text, metadata, and structured data from diverse document types.
Enabling AI-powered question-answering over diverse, large-scale enterprise data.