Awesome Similarity Search
πŸ“„Papers🧭TopicsπŸ‘₯AuthorsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸ“šPacksπŸ› οΈToolsπŸ“BlogsπŸ€–Ask AIβœ‰οΈNewsletterπŸš€Pro
+ Add Paper

← authors Β· overview

Bohan Zhuang

15 papers Β· 0 citations
Most-cited papers
  • Scalable Vision Transformers With Hierarchical Pooling
    2021 Β· 115 citations
  • Automated Progressive Learning For Efficient Training Of Vision Transformers
    2022 Β· 28 citations
  • Dynamic Focus-aware Positional Queries For Semantic Segmentation
    2022 Β· 14 citations
  • Cov: Chain-of-view Prompting For Spatial Reasoning
    2026
  • Blockvid: Block Diffusion For High-quality And Consistent Minute-long Video Generation
    2025
  • Frequency-aware Autoregressive Modeling For Efficient High-resolution Image Synthesis
    2025
  • Geometrically-constrained Agent For Spatial Reasoning
    2025
  • Omnisparse: Training-aware Fine-grained Sparse Attention For Long-video Mllms
    2025
  • An Empirical Study On How Video-llms Answer Video Questions
    2025
  • Less Detail, Better Answers: Degradation-driven Prompting For VQA
    2026
Topics
Visual QA & ReasoningVision-Language ModelsUncategorizedEmbodied & AgentsVideo-Language3D VisionSegmentationBenchmarksVideo UnderstandingVisual Language

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.