chinese-llm-benchmark: ReLE Benchmark (formerly CLiB) provides a continuously updated evaluation for Chinese AI large language models, covering over 337 commercial and open-source LLMs. It offers multi-dimensional capability assessments across various domains, along with comprehensive rankings and a large defect library for model improvement.; CopilotKit: CopilotKit is a React framework for embedding AI copilots, chatbots, and in-app agents directly into web applications. It provides UI components (chat interface, sidebar, textarea) and backend infrastructure for building agentic frontends — where AI can read and modify application state, take actions, and render generative UI. Supports LangChain, LangGraph, CrewAI, and custom agent backends.
Comparing and selecting the best performing LLMs for specific applications.
Adding a context-aware AI copilot to a SaaS product that understands app state