fast-agent: fast-agent is a CLI-first framework for building and interacting with sophisticated multimodal AI agents and workflows. It offers comprehensive support for various LLM providers, structured outputs, and vision, with unique features like MCP Feature support and live streaming responses to the terminal.; FunASR: FunASR is a fundamental end-to-end speech recognition toolkit. It offers industrial-grade speech recognition, being 170x faster than Whisper, supporting over 50 languages, and integrating features like speaker diarization, emotion detection, and streaming.
Rapid development and testing of sophisticated multimodal AI agents and workflows.
Meeting transcription with speaker labels, timestamps, and punctuation