awesome-generative-ai: This repository is a comprehensive, curated collection of resources in Generative AI, including academic papers, tools, courses, and artworks across various domains. It's structured into sections covering topics like LLMs, image synthesis, and AI ethics, with references updated in reverse chronological order to keep users abreast of the latest developments.; groundingLMM: GLaMM (Grounding Large Multimodal Model) is an end-to-end trained LMM capable of generating natural language responses integrated with object segmentation masks, enabling visual grounding and versatile interaction with images at multiple granularity levels. It introduces the novel task of Grounded Conversation Generation (GCG), supports various downstream applications like referring expression segmentation and region-level captioning, and is underpinned by the large-scale GranD dataset.
Learning and research in Generative AI.
Interactive visual assistants that understand and respond to user queries about specific image regions.