MMstar
Emerging2papers using it
21,187HF downloads
52HF likes
2025first seen
MMStar (Are We on the Right Way for Evaluating Large Vision-Language Models?) π Homepage | π€ Dataset | π€ Paper | π arXiv | GitHub Dataset Details As shown in the figure below, existing benchmarks lack consideration of the vision dependency of evaluation samples and potential data leakage from LLMs' and LVLMs' train