conductor: Conductor is a Netflix-developed platform designed to orchestrate complex workflows across microservices, supporting creation via JSON and code. However, Netflix discontinued its official OSS maintenance on December 13, 2023, while encouraging community forks and continued development.; AgentBench: AgentBench is a comprehensive benchmark for evaluating Large Language Models (LLMs) as agents across diverse environments, now featuring a function-calling version integrated with AgentRL. It provides a containerized setup for various tasks like OS interaction, database operations, and web shopping, enabling robust and reproducible agent evaluation.
Coordinating complex business processes involving multiple microservices.
Systematically benchmark the performance of various LLM-based agents.