ArtifactsBench

Name: ArtifactsBench
License: cc-by-nc-4.0

Emerging

3papers using it

110HF downloads

13HF likes

2025first seen

ArtifactsBench: Bridging the Visual-Interactive Gap in LLM Code Generation Evaluation Tencent Hunyuan Team 📖 Paper • 🏠 Home Page • 💻 Code • 🏆 Leaderboard • 📜 Citation Figure 1: Automation level versus human–alignment across evaluation frameworks. The red star marks the fully manual WebDev Arena (100% human effort)

🤗 Hugging Face⚖ cc-by-nc-4.0

Papers using ArtifactsBench (3)

ArtifactNet: Detecting AI-Generated Music via Forensic Residual Physics2026

ArtifactsBench: Bridging the Visual-Interactive Gap in LLM Code Generation Evaluation2025