Awesome Similarity Search
πŸ“„Papers🧭TopicsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸŽ“LearnπŸ€–Ask AI
β‹―More
πŸ‘₯AuthorsπŸ“šReading PacksπŸ› οΈToolsπŸ“Blogsβœ‰οΈNewsletterπŸ”–Saved
+ Add Paper

← authors Β· overview

Yezhou Yang

16 papers Β· 0 citations
Most-cited papers
  • Injecting Semantic Concepts Into End-to-end Image Captioning
    2021 Β· 113 citations
  • Modularized Textual Grounding For Counterfactual Resilience
    2019 Β· 17 citations
  • Getting It Right: Improving Spatial Consistency In Text-to-image Models
    2024 Β· 11 citations
  • On The Robustness Of Language Guidance For Low-level Vision Tasks: Findings From Depth Estimation
    2024 Β· 6 citations
  • REVISION: Rendering Tools Enable Spatial Fidelity In Vision-language Models
    2024 Β· 3 citations
  • Interact-video: Reasoning-rich Video QA For Urban Traffic
    2025
  • Vibetoken: Scaling 1D Image Tokenizers And Autoregressive Models For Dynamic Resolution Generations
    2026
  • Sepose: A Synthetic Event-based Human Pose Estimation Dataset For Pedestrian Monitoring
    2025
Topics
Visual LanguageImage Generation3D VisionObject DetectionVisual QA & ReasoningBenchmarksVideo-Languagecs.AIcs.LGcs.MM

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.