AI Research Directory
mb

MLE-bench

ML engineer benchmark (OpenAI)

Open-sourceResearch Agents & AI Scientists

What it is

OpenAI's benchmark for ML engineering agents — end-to-end modeling on real Kaggle competitions.

RELATED · SAME CATEGORY