chinese-llm-benchmark
ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括335个大模型,覆盖chatgpt、gpt-5.2、o4-mini、谷歌gemini-3-pro、Claude-4.5、文心ERNIE-X1.1、ERNIE-5.0-Thinking、qwen3-max、百川、讯飞星火、商汤senseChat等商用模型, 以及kimi-k2、ernie4.5、minimax-M2、deepseek-v3.2、qwen3-2507、llama4、智谱GLM-4.6、gemma3、mistral等开源大模型。不仅提供排行榜,也提供规模超200万的大模型缺陷库!方便广大社区研究分析、改进大模型。
Compatibility Index
OpenAI (GPT series)
SupportedVerified via documentation
Google (Gemini series)
SupportedVerified via documentation
Anthropic (Claude series)
SupportedVerified via documentation
Baidu (ERNIE series)
SupportedVerified via documentation
Alibaba (Qwen series)
SupportedVerified via documentation
DeepSeek
SupportedVerified via documentation
Community Discussion
💡Top Alternatives to chinese-llm-benchmark
Why use AgentIndex?
AgentIndex is the definitive directory for AI Agents, MCP Servers, and developer tools. We curate high-quality, open-source resources to help developers build the next generation of AI applications. Whether you are looking for autonomous agents, RAG frameworks, or IDE rules for Cursor and Windsurf, AgentIndex provides a centralized, searchable database to accelerate your development workflow.
What are MCP Servers?
The Model Context Protocol (MCP) is an open standard that enables AI models to securely interact with external data and tools. MCP Servers act as bridges, allowing Large Language Models (LLMs) like Claude and Gemini to access local files, databases, and APIs without direct integration. AgentIndex lists hundreds of ready-to-use MCP servers to enhance your AI's capabilities and context awareness.

