← all datasets

MMstar

Emerging
2papers using it
21,187HF downloads
52HF likes
2025first seen

MMStar (Are We on the Right Way for Evaluating Large Vision-Language Models?) 🌐 Homepage | πŸ€— Dataset | πŸ€— Paper | πŸ“– arXiv | GitHub Dataset Details As shown in the figure below, existing benchmarks lack consideration of the vision dependency of evaluation samples and potential data leakage from LLMs' and LVLMs' train

Papers using MMstar (2)

MMstar β€” datasets β€” llm-papers