Pan Zhang
12 papers · 2015 citations
Most-cited papers
- Are We On The Right Way For Evaluating Large Vision-language Models?2024 · 736 citations
- Internlm2 Technical Report2024 · 378 citations
- Internlm-xcomposer2: Mastering Free-form Text-image Composition And Comprehension In Vision-language Large Model2024 · 372 citations
- Sharegpt4v: Improving Large Multi-modal Models With Better Captions2023 · 237 citations
- Cross-domain Correspondence Learning For Exemplar-based Image Translation2020 · 218 citations
- Internlm-xcomposer-2.5: A Versatile Large Vision Language Model Supporting Long-contextual Input And Output2024 · 192 citations
- Streaming Long Video Understanding With Large Language Models2024 · 158 citations
- RAR: Retrieving And Ranking Augmented Mllms For Visual Recognition2024 · 2 citations
- MMDU: A Multi-turn Multi-image Dialog Understanding Benchmark And Instruction-tuning Dataset For Lvlms2024 · 1 citations
- Internlm-xcomposer2: Mastering Free-form Text-image Composition And Comprehension In Vision-language Large Model2024
Topics