FunASR: FunASR is a fundamental end-to-end speech recognition toolkit. It offers industrial-grade speech recognition, being 170x faster than Whisper, supporting over 50 languages, and integrating features like speaker diarization, emotion detection, and streaming.; openrouter-mcp-multimodal: The only OpenRouter MCP server with native vision, image generation, and smart image optimization. It provides access to over 300 LLMs through the Model Context Protocol, supporting multimodal workflows like image analysis, image generation, and chat. The server features zero external HTTP dependencies, lazy sharp loading, and a singleton model cache for efficient performance.
Meeting transcription with speaker labels, timestamps, and punctuation
Chat with any of 300+ models via natural language