Awesome Speech Audio
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Xu Yang

14 papers · 34 citations
Most-cited papers
  • Weakly Aligned Feature Fusion For Multimodal Object Detection
    2022 · 77 citations
  • Deconfounded Image Captioning: A Causal Retrospect
    2020 · 60 citations
  • Soccernet 2025 Challenges Results
    2025 · 33 citations
  • Auto-parsing Network For Image Captioning And Visual Question Answering
    2021 · 33 citations
  • Texture-preserving Diffusion Models For High-fidelity Virtual Try-on
    2024 · 33 citations
  • Unseen Object Instance Segmentation With Fully Test-time RGB-D Embeddings Adaptation
    2022 · 9 citations
  • ERF-BA-TFD+: A Multimodal Model For Audio-visual Deepfake Detection
    2025 · 1 citations
  • Agent^2 Rl-bench: Can LLM Agents Engineer Agentic RL Post-training?
    2026
  • Loupe: A Generalizable And Adaptive Framework For Image Forgery Detection
    2025
  • Extracting Multimodal Learngene In CLIP: Unveiling The Multimodal Generalizable Knowledge
    2025
  • Unleashing Vision Foundation Models For Coronary Artery Segmentation: Parallel Vit-cnn Encoding And Variational Fusion
    2025
  • One Request, Multiple Experts: LLM Orchestrates Domain Specific Models Via Adaptive Task Routing
    2025
Topics
BenchmarksObject DetectionImage GenerationVisual LanguageMulti-AgentVision-Language ModelsCode AgentsSegmentation3D VisionUncategorized

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.