ggui: ggui is an open protocol and reference runtime that allows AI agents to generate ephemeral, interactive user interfaces over MCP without writing frontend code. It can be self-hosted and paired with any MCP-aware agent runtime such as Claude Desktop, Cursor, or ChatGPT desktop.; FunASR: FunASR is a fundamental end-to-end speech recognition toolkit. It offers industrial-grade speech recognition, being 170x faster than Whisper, supporting over 50 languages, and integrating features like speaker diarization, emotion detection, and streaming.
AI agent pushes dynamic UIs to users' browsers in real-time
Meeting transcription with speaker labels, timestamps, and punctuation