AgentIndex icon
AgentIndex
ToolsCategoriesTrendingNewCompare
Submit Tool
Home/
Vision / Multimodal/
vmlx
vmlx logo

vmlx

Active·★ 551·Apache-2.0·Updated 2026-05-24
★ LLM Infra★ Dev Tooling

vMLX - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers MLX Studio. Image gen/edit, OpenAI/Anth

vMLX is a local AI inference engine for Apple Silicon Macs that runs LLMs, VLMs, and image generation models. It provides OpenAI and Anthropic compatible APIs with advanced features like continuous batching, prefix caching, KV cache quantization, speculative decoding, and tool calling. No cloud or API keys required, ensuring data privacy.

#anthropic-api#kvcache-compression#kvcache-optimization#kvcache-reuse#llm#lmstudio#macbook#mcp-server
$ Install
$ pip install vmlx
↗ Visit site★ GitHub
01

Features

01Continuous Batching for efficient concurrent request handling
02Prefix Cache with memory-aware and paged KV cache
03KV Cache Quantization to q4/q8 for memory savings
04Speculative Decoding for 20-90% speedup
05Tool Calling with auto-detected parsers for major model families
02

Compatibility

Apple Silicon Mac
macOS (M1/M2/M3/M4)
Verified via docs
03

Quick start

1
$ pip install vmlx
04

Use cases

↳Run local AI assistants with chatbot and agentic coding capabilities
↳Generate and edit images locally using Flux models
↳Deploy a fully local API server compatible with OpenAI and Anthropic SDKs
05

Alternatives

ragflow logo
ragflow★ 81.5k
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
vs →
n8n logo
n8n★ 190.2k
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
vs →
FunASR logo
FunASR★ 16.6k
Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
vs →
nuclear logo
nuclear★ 17.7k
Streaming music player that finds free music for you
vs →
semble logo
semble★ 4.5k
Fast and Accurate Code Search for Agents
vs →
csharp-sdk logo
csharp-sdk★ 4.3k
The official C# SDK for Model Context Protocol servers and clients. Maintained in collaboration with Microsoft.
vs →
fast-agent logo
fast-agent★ 3.8k
Code, Build and Evaluate agents - excellent Model and Skills/MCP/ACP Support
vs →
initrunner logo
initrunner★ 38
Define AI agent roles in YAML and run them anywhere: CLI, API server, or autonomous daemon
vs →
See all alternatives →

Related searches

vmlx AlternativesBest Vision / Multimodal Tools 2026Open Source Vision / Multimodalvmlx Tutorialvmlx Vs Competitorsanthropic-apikvcache-compressionkvcache-optimization

Comments

Log in to leave a comment
  • D
    Dylan GarciaMar 31, 2026

    Setup was straightforward, seamless config and running in minutes. Integrates well with existing cont setups.

  • A
    Avery AndersonMar 12, 2026

    Used this for reliable automation — reliable under load — image gen/edit, openai/anth. The maintainers are responsive to issues.

  • E
    Emerson GarciaMar 11, 2026

    Setup was straightforward, reliable config and running in minutes — image gen/edit, openai/anth. Integrates well with existing batch setups.

  • P
    Peyton DavisFeb 28, 2026

    Used this for clean automation — reliable under load. Integrates well with existing vmlx setups.

On this page
01Features02Compatibility03Quick start04Use cases05Alternatives
Stats
GitHub Stars★ 551
Last commit6d ago
StatusActive
LicenseApache-2.0
CategoryVision / Multimodal
Trend (30d)
+22↑ 0.6%
Links
Documentation↗Discussion↗Issues↗Releases↗

Deploy on DigitalOcean — Get $200 Free Credit

Ad
© 2026 AgentIndex.app|Built by a 10-year iOS Developer.
QYSGitHubBuy me a coffee ☕

Browse by Category

Code AssistantWorkflow AutomationRAG / Knowledge BaseMulti-AgentBrowser AutomationLLM InfraDev ToolingObservability

Not affiliated with Anthropic, OpenAI or Microsoft.