chinese-llm-benchmark: ReLE Benchmark (formerly CLiB) provides a continuously updated evaluation for Chinese AI large language models, covering over 337 commercial and open-source LLMs. It offers multi-dimensional capability assessments across various domains, along with comprehensive rankings and a large defect library for model improvement.; mindsdb: MindsDB is an open-source server that empowers AI, agents, and applications to obtain accurate answers from diverse, large-scale data sources. It features a robust architecture to connect and unify data from databases, data warehouses, and SaaS applications, and then respond to queries using built-in AI agents and its Model Context Protocol (MCP).
Comparing and selecting the best performing LLMs for specific applications.
Enabling AI-powered question-answering over diverse, large-scale enterprise data.