groundingLMM: GLaMM (Grounding Large Multimodal Model) is an end-to-end trained LMM capable of generating natural language responses integrated with object segmentation masks, enabling visual grounding and versatile interaction with images at multiple granularity levels. It introduces the novel task of Grounded Conversation Generation (GCG), supports various downstream applications like referring expression segmentation and region-level captioning, and is underpinned by the large-scale GranD dataset.; n8n: n8n is a secure workflow automation platform for technical teams, combining the flexibility of code with the speed of no-code. It offers 400+ integrations, native AI capabilities, and allows full control over data and deployments with its fair-code license.
Interactive visual assistants that understand and respond to user queries about specific image regions.
Automating diverse business and technical workflows.