Yu-Gang Jiang
37 papers · 9 citations
Most-cited papers
- M2TR: Multi-modal Multi-scale Transformers For Deepfake Detection2021 · 288 citations
- Anygpt: Unified Multimodal LLM With Discrete Sequence Modeling2024 · 238 citations
- Cross-domain Contrastive Learning For Unsupervised Domain Adaptation2021 · 149 citations
- To See Is To Believe: Prompting GPT-4V For Better Visual Instruction Tuning2023 · 145 citations
- Depth Guided Adaptive Meta-fusion Network For Few-shot Video Recognition2020 · 86 citations
- Agentgym: Evolving Large Language Model-based Agents Across Diverse Environments2024 · 83 citations
- Reuse And Diffuse: Iterative Denoising For Text-to-video Generation2023 · 59 citations
- Deepstack: Deeply Stacking Visual Tokens Is Surprisingly Simple And Effective For Lmms2024 · 47 citations
- Implicit Temporal Modeling With Learnable Alignment For Video Recognition2023 · 39 citations
- Context Perception Parallel Decoder For Scene Text Recognition2023 · 23 citations
- Mevis: A Multi-modal Dataset For Referring Motion Expression Video Segmentation2025 · 6 citations
- You Only Estimate Once: Unified, One-stage, Real-time Category-level Articulated Object 6D Pose Estimation For Robotic Grasping2025 · 2 citations
- Towards Omnimodal Expressions And Reasoning In Referring Audio-visual Segmentation2025 · 1 citations
- Agentgym: Evolving Large Language Model-based Agents Across Diverse Environments2024
- Thinking With Deltas: Incentivizing Reinforcement Learning Via Differential Visual Reasoning Policy2026
Topics