FunASR: FunASR is a fundamental end-to-end speech recognition toolkit. It offers industrial-grade speech recognition, being 170x faster than Whisper, supporting over 50 languages, and integrating features like speaker diarization, emotion detection, and streaming.; ocr-mcp: OCR-MCP is a complete AI OCR webapp and MCP server. It provides a web interface for drag-and-drop OCR, scanning, and batch processing, and a FastMCP server for agentic IDEs like Claude, Cursor, Windsurf. It supports 13 OCR engines, WIA scanner, preprocessing, and workflow pipelines.
Meeting transcription with speaker labels, timestamps, and punctuation
Integrate OCR capabilities into AI agents (Claude, Cursor)