FunASR: FunASR is a fundamental end-to-end speech recognition toolkit. It offers industrial-grade speech recognition, being 170x faster than Whisper, supporting over 50 languages, and integrating features like speaker diarization, emotion detection, and streaming.; plasmate: Plasmate compiles HTML into a Semantic Object Model (SOM), a structured representation that LLMs can reason about directly. It runs JavaScript via V8, supports Puppeteer via CDP, and produces output that is 10-800x smaller than raw HTML. It is purpose-built for AI agent pipelines, with MCP, Vercel AI SDK integrations, and over 60 ecosystem integrations.
Meeting transcription with speaker labels, timestamps, and punctuation
AI agent web browsing and structured data extraction