conductor: Conductor is an open-source, scalable microservices orchestration engine originally built at Netflix. It empowers developers to define and manage resilient, distributed, and asynchronous workflows across various services and systems.; AgentBench: AgentBench is a comprehensive benchmark for evaluating Large Language Models (LLMs) as agents across diverse environments, now featuring a function-calling version integrated with AgentRL. It provides a containerized setup for various tasks like OS interaction, database operations, and web shopping, enabling robust and reproducible agent evaluation.
Orchestrating complex microservice interactions
Systematically benchmark the performance of various LLM-based agents.