dunetrace: Dunetrace monitors every run of AI agents in real-time, detecting structural failures like tool loops, retry storms, and context bloat within 15 seconds of completion. It fires alerts via Slack or webhook, provides plain-English explanations, and offers one-click fixes through Langfuse or GitHub PRs.; Auto-claude-code-research-in-sleep: Auto-claude-code-research-in-sleep (ARIS) is a set of custom Claude Code skills for autonomous ML research workflows. It orchestrates cross-model collaboration, with Claude Code executing research tasks and an external LLM (like GPT-5.4) critically reviewing. This system can autonomously discover ideas, run experiments, and write/refine research papers, allowing researchers to wake up to ready-to-submit results.
Monitor production AI agents for silent failures like tool loops and cost spikes
Explore new research areas and discover novel ideas through literature surveys and brainstorming.