FunASR: FunASR is a fundamental end-to-end speech recognition toolkit. It offers industrial-grade speech recognition, being 170x faster than Whisper, supporting over 50 languages, and integrating features like speaker diarization, emotion detection, and streaming.; apitap: ApiTap is an MCP server that lets AI agents browse the web through APIs instead of browsers, automatically detecting a site's framework and discovering its internal API endpoints. It generates reusable skill files for direct API calls, reducing token costs by 20-100x compared to browser automation.
Meeting transcription with speaker labels, timestamps, and punctuation
AI agents can gather data from websites without using a browser