ragflow: RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that integrates RAG with Agent capabilities. It provides a superior context layer for LLMs and offers a streamlined RAG workflow adaptable to enterprises of any scale.; groundingLMM: GLaMM (Grounding Large Multimodal Model) is an end-to-end trained LMM capable of generating natural language responses integrated with object segmentation masks, enabling visual grounding and versatile interaction with images at multiple granularity levels. It introduces the novel task of Grounded Conversation Generation (GCG), supports various downstream applications like referring expression segmentation and region-level captioning, and is underpinned by the large-scale GranD dataset.
Building high-fidelity, production-ready AI systems with complex data.
Interactive visual assistants that understand and respond to user queries about specific image regions.