mb
MLE-bench
ML engineer benchmark (OpenAI)
Open-sourceResearch Agents & AI Scientists
What it is
OpenAI's benchmark for ML engineering agents — end-to-end modeling on real Kaggle competitions.
ML engineer benchmark (OpenAI)
OpenAI's benchmark for ML engineering agents — end-to-end modeling on real Kaggle competitions.