Yixiao Ge
10 papers · 0 citations
Most-cited papers
- Yolo-world: Real-time Open-vocabulary Object Detection2024 · 546 citations
- Self-supervising Fine-grained Region Similarities For Large-scale Image Localization2020 · 129 citations
- Bridging Video-text Retrieval With Multiple Choice Questions2022 · 111 citations
- Progressive Correspondence Pruning By Consensus Learning2021 · 86 citations
- Object-aware Video-language Pre-training For Retrieval2021 · 46 citations
- MILES: Visual BERT Pre-training With Injected Language Semantics For Video-text Retrieval2022 · 25 citations
- Atp-llava: Adaptive Token Pruning For Large Vision Language Models2024 · 14 citations
- Learning Transferable Spatiotemporal Representations From Natural Script Knowledge2022 · 3 citations
- Darwinian Model Upgrades: Model Evolving With Selective Compatibility2022 · 1 citations
- Ppllava: Varied Video Sequence Understanding With Prompt Guidance2026
- Toklip: Marry Visual Tokens To CLIP For Multimodal Comprehension And Generation2025
- Video-holmes: Can MLLM Think Like Holmes For Complex Video Reasoning?2025
- Haploomni: Unified Single Transformer For Multimodal Video Understanding And Generation2025
- DIAL: Decoupling Intent And Action Via Latent World Modeling For End-to-end VLA2026
- DIAL: Decoupling Intent And Action Via Latent World Modeling For End-to-end VLA2026
Topics