Ran Xu
13 papers · 0 citations
Most-cited papers
- ULIP: Learning A Unified Representation Of Language, Images, And Point Clouds For 3D Understanding2022 · 216 citations
- ULIP-2: Towards Scalable Multimodal Pre-training For 3D Understanding2023 · 88 citations
- Ehragent: Code Empowers Large Language Models For Few-shot Complex Tabular Reasoning On Electronic Health Records2024 · 87 citations
- FOFO: A Benchmark To Evaluate Llms' Format-following Capability2024 · 82 citations
- X-instructblip: A Framework For Aligning X-modal Instruction-aware Representations To Llms And Emergent Cross-modal Reasoning2023 · 79 citations
- Mapgpt: Map-guided Prompting With Adaptive Path Planning For Vision-and-language Navigation2024 · 37 citations
- Mask-free OVIS: Open-vocabulary Instance Segmentation Without Manual Mask Annotations2023 · 13 citations
- Naturalvlm: Leveraging Fine-grained Natural Language For Affordance-guided Visual Manipulation2024 · 11 citations
- Blip3-o: A Family Of Fully Open Unified Multimodal Models-architecture, Training And Dataset2025
- Blip3o-next: Next Frontier Of Native Image Generation2025
- Robotic VLA Benefits From Joint Learning With Motion Image Diffusion2025
- Scaling Agentic Reinforcement Learning For Tool-integrated Reasoning In Vlms2025
- Coact-1: Computer-using Multi-agent System With Coding Actions2025
- Dymu: Dynamic Merging And Virtual Unmerging For Efficient Vlms2025
- Engineering.ai: A Platform For Teams Of AI Engineers In Computational Design2025
Topics