MLAgentBench

ML research agent benchmark

Open-sourceResearch Agents & AI Scientists

What it is

Benchmark suite for ML research agents — realistic tasks like hyperparameter tuning and architecture search.

Collaborative AI agent research lab

Autonomous ML research agent

Automated research workflow engine