Hang Xu
18 papers · 0 citations
Most-cited papers
- Voxel Transformer For 3D Object Detection2021 · 459 citations
- Eyes Closed, Safety On: Protecting Multimodal Llms Via Image-to-text Transformation2024 · 123 citations
- Mixture Of Cluster-conditional Lora Experts For Vision-language Instruction Tuning2023 · 109 citations
- Navcot: Boosting Llm-based Vision-and-language Navigation Via Learning Disentangled Reasoning2024 · 99 citations
- Reuse And Diffuse: Iterative Denoising For Text-to-video Generation2023 · 59 citations
- Detclipv3: Towards Versatile Generative Open-vocabulary Object Detection2024 · 55 citations
- Any-size-diffusion: Toward Efficient Text-driven Synthesis For Any-size HD Images2023 · 19 citations
- Generative Negative Text Replay For Continual Vision-language Pretraining2022 · 13 citations
- Explicitly Guided Information Interaction Network For Cross-modal Point Cloud Completion2024 · 12 citations
- Self-adaptive Reality-guided Diffusion For Artifact-free Super-resolution2024 · 12 citations
- Thinking With Geometry: Active Geometry Integration For Spatial Reasoning2026
- Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection2025
- C2-evo: Co-evolving Multimodal Data And Model For Self-improving Reasoning2025
- ECCV 2024 W-CODA: 1st Workshop On Multimodal Perception And Comprehension Of Corner Cases In Autonomous Driving2025
- Percept-wam: Perception-enhanced World-awareness-action Model For Robust End-to-end Autonomous Driving2025
Topics