chinese-llm-benchmark: ReLE Benchmark (formerly CLiB) provides a continuously updated evaluation for Chinese AI large language models, covering over 337 commercial and open-source LLMs. It offers multi-dimensional capability assessments across various domains, along with comprehensive rankings and a large defect library for model improvement.; dagster: Dagster is a data orchestrator purpose-built for data platforms in the MLOps era, helping users define, develop, and operate data assets. It offers a powerful programming model, local development experience, and a robust UI for observing and debugging pipelines in production.
Comparing and selecting the best performing LLMs for specific applications.
Building reliable data platforms