GitHub MCP Server: GitHub's official MCP Server connects AI agents directly to GitHub, enabling natural language interactions with repositories, issues, pull requests, code search, Actions workflows, and security findings. Built for developers who want to bring GitHub context into AI assistants — from simple queries to complex multi-step agent workflows. Available as a remote server or self-hosted Docker container.; AgentBench: AgentBench is a comprehensive benchmark for evaluating Large Language Models (LLMs) as agents across diverse environments, now featuring a function-calling version integrated with AgentRL. It provides a containerized setup for various tasks like OS interaction, database operations, and web shopping, enabling robust and reproducible agent evaluation.
AI-assisted code review, issue triage, and PR management in GitHub
Systematically benchmark the performance of various LLM-based agents.