MMMU-Pro
Emerging12papers using it
2024first seen
Papers using MMMU-Pro (12)
- Xiaomi Mimo-vl-miloco Technical ReportVLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language ModelsVision Verification Enhanced Fusion of VLMs for Efficient Visual ReasoningVOLD: Reasoning Transfer from LLMs to Vision-Language Models via On-Policy DistillationSocratic-MCTS: Test-Time Visual Reasoning by Asking the Right QuestionsBelieving Without Seeing: Quality Scores For Contextualizing Vision-language Model ExplanationsJmmmu-pro: Image-based Japanese Multi-discipline Multimodal Understanding Benchmark Via Vibe Benchmark ConstructionSRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement LearningChain-of-Description: What I can understand, I can put into wordsMMMU-Pro: A More Robust Multi-discipline Multimodal Understanding BenchmarkVL-RewardBench: A Challenging Benchmark for Vision-Language Generative Reward ModelsMAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale