Awesome Speech Audio
πŸ“„Papers🧭TopicsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸ€–Ask AI
β‹―More
πŸ‘₯AuthorsπŸ“šReading PacksπŸ› οΈToolsπŸ“Blogsβœ‰οΈNewsletterπŸ”–Saved
+ Add Paper

← authors Β· overview

Gang Yu

20 papers Β· 0 citations
Most-cited papers
  • Topformer: Token Pyramid Transformer For Mobile Semantic Segmentation
    2022 Β· 287 citations
  • Context Prior For Scene Segmentation
    2020 Β· 240 citations
  • Metric3dv2: A Versatile Monocular Geometric Foundation Model For Zero-shot Metric Depth And Surface Normal Estimation
    2024 Β· 201 citations
  • Scene Text Detection With Supervised Pyramid Context Network
    2018 Β· 161 citations
  • End-to-end 3D Dense Captioning With Vote2cap-detr
    2023 Β· 49 citations
  • Vision Foundation Models As Effective Visual Tokenizers For Autoregressive Image Generation
    2025
  • Native 3D Editing With Full Attention
    2025
  • Oneig-bench: Omni-dimensional Nuanced Evaluation For Image Generation
    2025
  • Regione: Adaptive Region-aware Generation For Efficient Image Editing
    2025
  • Sparse-vdit: Unleashing The Power Of Sparse Attention To Accelerate Video Diffusion Transformers
    2025
Topics
Segmentation3D VisionObject DetectionVision-Language ModelsBenchmarksVisual QA & ReasoningUncategorizedVisual LanguageInstruction Tuning

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.