Zuxuan Wu
26 papers · 2 citations
Most-cited papers
- M2TR: Multi-modal Multi-scale Transformers For Deepfake Detection2021 · 288 citations
- Cross-domain Contrastive Learning For Unsupervised Domain Adaptation2021 · 149 citations
- To See Is To Believe: Prompting GPT-4V For Better Visual Instruction Tuning2023 · 145 citations
- M3detr: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection With Transformers2021 · 133 citations
- The Regretful Agent: Heuristic-aided Navigation Through Progress Estimation2019 · 101 citations
- Agentgym: Evolving Large Language Model-based Agents Across Diverse Environments2024 · 83 citations
- Reuse And Diffuse: Iterative Denoising For Text-to-video Generation2023 · 59 citations
- Synthesize, Diagnose, And Optimize: Towards Fine-grained Vision-language Understanding2023 · 52 citations
- Deepstack: Deeply Stacking Visual Tokens Is Surprisingly Simple And Effective For Lmms2024 · 47 citations
- Implicit Temporal Modeling With Learnable Alignment For Video Recognition2023 · 39 citations
- 2D Or Not 2D? Adaptive 3D Convolution Selection For Efficient Video Recognition2020 · 27 citations
- Drivesuprim: Towards Precise Trajectory Selection For End-to-end Planning2025 · 2 citations
- Agentgym: Evolving Large Language Model-based Agents Across Diverse Environments2024
- Self-monitoring Navigation Agent Via Auxiliary Progress Estimation2019
- Thinking With Deltas: Incentivizing Reinforcement Learning Via Differential Visual Reasoning Policy2026
Topics