AgentIndex icon
AgentIndex
ToolsCategoriesTrendingNewCompare
Submit Tool
ToolsCategoriesTrendingNewCompare
Home/
Compare/
FunASR vs worldlabs-mcp
FunASR logo
FunASR
★ 16.6k
vs
worldlabs-mcp logo
worldlabs-mcp
★ 13

FunASR vs worldlabs-mcp

FunASR: FunASR is a fundamental end-to-end speech recognition toolkit. It offers industrial-grade speech recognition, being 170x faster than Whisper, supporting over 50 languages, and integrating features like speaker diarization, emotion detection, and streaming.; worldlabs-mcp: worldlabs-mcp is a Model Context Protocol gateway to World Labs' Marble and Spark 2.0 engines. It enables generation of navigable 3D worlds from diverse inputs (text, image, video, panorama), real-time streaming via Gaussian splat rendering, and spatial voice agent integration. The project includes a web dashboard, multiple export targets (Resonite, Blender, Unity), and VR headset support.

01

TL;DR

FunASR logoChoose FunASR if…

Meeting transcription with speaker labels, timestamps, and punctuation

worldlabs-mcp logoChoose worldlabs-mcp if…

Generate navigable 3D worlds for VR/AR experiences from text or images

02

Side-by-Side Comparison

Field
FunASR logoFunASR
worldlabs-mcp logoworldlabs-mcp
Category
Voice / Speech
Vision / Multimodal
Stars
★ 16.6k
★ 13
License
MIT
—
Updated
1d ago
3d ago
Open Source
Yes
Yes
Website
↗ Visit
↗ Visit
GitHub
↗ GitHub
↗ GitHub
Tags
asr, audio, chinese
marble, mcp-server, mcp-servers
03

Features

FunASR logoFunASR
01Extremely fast (170x faster than Whisper)
02Supports 50+ languages
03Built-in Speaker Diarization
04Emotion Detection
05Streaming ASR and vLLM Acceleration
worldlabs-mcp logoworldlabs-mcp
01Marble 1.1+ world generation from text, image, multi-image, video, or local file upload
02Spark 2.0 Spatial Engine with hierarchical LoD and virtual GPU paging for Gaussian-splat streaming
03World Library with card/list view, thumbnails, search, filters, and asset downloads
04Painting Portals: generate 3D worlds from famous paintings
05Spatial Voice Agent with built-in TTS and coordinate-grounded narration
04

Use Cases

FunASR logoFunASR
↳Meeting transcription with speaker labels, timestamps, and punctuation
↳Deployment as an OpenAI-compatible API server
↳Integration with AI agents (e.g., Claude, LangChain, Dify, AutoGen)
worldlabs-mcp logoworldlabs-mcp
↳Generate navigable 3D worlds for VR/AR experiences from text or images
↳Transform famous paintings into immersive 3D scenes for virtual galleries
↳Create interactive spatial narratives with voice agents grounded in scene coordinates
05

Best For

FunASR logoFunASR
Most PopularVoice / SpeechLLM Infra
worldlabs-mcp logoworldlabs-mcp
TrendingVision / MultimodalAPI Integration
FAQ

FAQ

What is the difference between FunASR and worldlabs-mcp?
Both FunASR and worldlabs-mcp are in the Voice / Speech category. FunASR has 16.6k stars, while worldlabs-mcp has 13 stars.
Which is better, FunASR or worldlabs-mcp?
The best choice depends on your use case. Choose FunASR if Meeting transcription with speaker labels, timestamps, and punctuation, and worldlabs-mcp if Generate navigable 3D worlds for VR/AR experiences from text or images.
Is FunASR free or open source?
Yes, FunASR is open source on GitHub (MIT).
Is worldlabs-mcp free or open source?
Yes, worldlabs-mcp is open source on GitHub.
→

Related

Alternatives to FunASR →Alternatives to worldlabs-mcp →FunASR details →worldlabs-mcp details →OpenClaw vs FunASR →
© 2026 AgentIndex.app|Built by a 10-year iOS Developer.
QYSGitHubBuy me a coffee ☕

Browse by Category

Code AssistantWorkflow AutomationRAG / Knowledge BaseMulti-AgentBrowser AutomationLLM InfraDev ToolingObservability

Not affiliated with Anthropic, OpenAI or Microsoft.