Wei Li
47 papers ยท 4 citations
Most-cited papers
- Exploring The Limits Of Transfer Learning With A Unified Text-to-text Transformer2019 ยท 25314 citations
- How Far Are We To GPT-4V? Closing The Gap To Commercial Multimodal Models With Open-source Suites2024 ยท 1136 citations
- Internlm2 Technical Report2024 ยท 378 citations
- Internlm-xcomposer2: Mastering Free-form Text-image Composition And Comprehension In Vision-language Large Model2024 ยท 372 citations
- How Far Are We To GPT-4V? Closing The Gap To Commercial Multimodal Models With Open-source Suites2024 ยท 339 citations
- Internlm-xcomposer-2.5: A Versatile Large Vision Language Model Supporting Long-contextual Input And Output2024 ยท 192 citations
- Is-fusion: Instance-scene Collaborative Fusion For Multimodal 3D Object Detection2024 ยท 79 citations
- Cap2det: Learning To Amplify Weak Caption Supervision For Object Detection2019 ยท 36 citations
- Mosaicfusion: Diffusion Models As Data Augmenters For Large Vocabulary Instance Segmentation2023 ยท 19 citations
- Delving Into Out-of-distribution Detection With Vision-language Representations2022 ยท 12 citations
- Smarthome-bench: A Comprehensive Benchmark For Video Anomaly Detection In Smart Homes Using Multi-modal Large Language Models2025 ยท 2 citations
- Medground-r1: Advancing Medical Image Grounding Via Spatial-semantic Rewarded Group Relative Policy Optimization2025 ยท 1 citations
- Uavd-mamba: Deformable Token Fusion Vision Mamba For Multimodal UAV Detection2025 ยท 1 citations
- Prefill-time Intervention For Mitigating Hallucination In Large Vision-language Models2026
- Prefill-time Intervention For Mitigating Hallucination In Large Vision-language Models2026
Topics