Shanghang Zhang
33 papers · 3 citations
Most-cited papers
- Pointclip V2: Prompting CLIP And GPT For Powerful 3D Open-world Learning2022 · 158 citations
- Draw-and-understand: Leveraging Visual Prompts To Enable Mllms To Comprehend What You Want2024 · 100 citations
- Msinet: Twins Contrastive Search Of Multi-scale Interaction For Object Reid2023 · 89 citations
- Unsupervised Domain Adaptive 3D Detection With Multi-level Consistency2021 · 75 citations
- Mttrans: Cross-domain Object Detection With Mean-teacher Transformer2022 · 49 citations
- LLM As Dataset Analyst: Subpopulation Structure Discovery With Large Language Model2024 · 48 citations
- Freekd: Knowledge Distillation Via Semantic Frequency Prompt2023 · 47 citations
- COLE: A Hierarchical Generation Framework For Multi-layered And Editable Graphic Design2023 · 32 citations
- Cloud-device Collaborative Learning For Multimodal Large Language Models2023 · 15 citations
- BEVUDA++: Geometric-aware Unsupervised Domain Adaptation For Multi-view 3D Object Detection2025 · 3 citations
- Mathsticks: A Benchmark For Visual Symbolic Compositional Reasoning With Matchstick Puzzles2025
- Fastinit: Fast Noise Initialization For Temporally Consistent Video Generation2025
- Uniedit-i: Training-free Image Editing For Unified VLM Via Iterative Understanding, Editing And Verifying2025
- Physicsmind: Sim And Real Mechanics Benchmarking For Physical Reasoning And Prediction In Foundational Vlms And World Models2026
Topics