vmlx: vMLX is a local AI inference engine for Apple Silicon Macs that runs LLMs, VLMs, and image generation models. It provides OpenAI and Anthropic compatible APIs with advanced features like continuous batching, prefix caching, KV cache quantization, speculative decoding, and tool calling. No cloud or API keys required, ensuring data privacy.; semble: Semble is a high-performance code search library designed for AI agents, providing instant access to precise code snippets. It offers significantly faster indexing and querying compared to transformer models, achieving 99% of their retrieval quality while running entirely on CPU without external dependencies.
Run local AI assistants with chatbot and agentic coding capabilities
Enhancing AI agents (e.g., Claude Code, Cursor, Codex) with fast and accurate code search capabilities