Awesome Speech Audio
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Yezhou Yang

16 papers · 0 citations
Most-cited papers
  • Injecting Semantic Concepts Into End-to-end Image Captioning
    2021 · 113 citations
  • Modularized Textual Grounding For Counterfactual Resilience
    2019 · 17 citations
  • Getting It Right: Improving Spatial Consistency In Text-to-image Models
    2024 · 11 citations
  • On The Robustness Of Language Guidance For Low-level Vision Tasks: Findings From Depth Estimation
    2024 · 6 citations
  • REVISION: Rendering Tools Enable Spatial Fidelity In Vision-language Models
    2024 · 3 citations
  • Interact-video: Reasoning-rich Video QA For Urban Traffic
    2025
  • Sepose: A Synthetic Event-based Human Pose Estimation Dataset For Pedestrian Monitoring
    2025
  • Vibetoken: Scaling 1D Image Tokenizers And Autoregressive Models For Dynamic Resolution Generations
    2026
Topics
Visual LanguageImage Generation3D VisionUncategorizedObject DetectionVisual QA & ReasoningBenchmarksVideo-Language

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.