agents-towards-production: Agents Towards Production is an open-source playbook for building production-ready GenAI agents that scale from prototype to enterprise. It provides hands-on tutorials covering a comprehensive range of topics from orchestration and memory to deployment and security.; chinese-llm-benchmark: ReLE Benchmark (formerly CLiB) provides a continuously updated evaluation for Chinese AI large language models, covering over 337 commercial and open-source LLMs. It offers multi-dimensional capability assessments across various domains, along with comprehensive rankings and a large defect library for model improvement.
Developing scalable, production-ready Generative AI agents.
Comparing and selecting the best performing LLMs for specific applications.