Mike Zheng Shou
30 papers ยท 3 citations
Most-cited papers
- Hallucination Of Multimodal Large Language Models: A Survey2024 ยท 364 citations
- Show-1: Marrying Pixel And Latent Diffusion Models For Text-to-video Generation2023 ยท 334 citations
- Unified Transformer Tracker For Object Tracking2022 ยท 117 citations
- Egovlpv2: Egocentric Video-language Pre-training With Fusion In The Backbone2023 ยท 87 citations
- CVPR 2023 Text Guided Video Editing Competition2023 ยท 58 citations
- Generic Event Boundary Detection: A Benchmark For Event Segmentation2021 ยท 54 citations
- Object-aware Video-language Pre-training For Retrieval2021 ยท 46 citations
- Paragraph-to-image Generation With Information-enriched Diffusion Model2023 ยท 43 citations
- ASSISTGUI: Task-oriented Desktop Graphical User Interface Automation2023 ยท 41 citations
- Stprivacy: Spatio-temporal Privacy-preserving Action Recognition2023 ยท 28 citations
- Position-guided Text Prompt For Vision-language Pre-training2022 ยท 28 citations
- Towards Fast Adaptation Of Pretrained Contrastive Models For Multi-channel Video-language Retrieval2022 ยท 9 citations
- SAM-I2V: Upgrading SAM To Support Promptable Video Segmentation With Less Than 0.2% Training Cost2025 ยท 3 citations
- Darwinian Model Upgrades: Model Evolving With Selective Compatibility2022 ยท 1 citations
- Paper2video: Automatic Video Generation From Scientific Papers2025
Topics