MLE-Bench
Emerging6papers using it
2025first seen
Papers using MLE-Bench (6)
- MARS: Modular Agent with Reflective Search for Automated AI ResearchAIBuildAI: An AI Agent for Automatically Building AI ModelsiML: Executable, Problem-Grounded, and Broadly Exploratory Code-Driven AutoMLToward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning EngineeringArchPilot: A Proxy-Guided Multi-Agent Approach for Machine Learning EngineeringML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning