Awesome Speech Audio
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Bo Zhao

16 papers · 0 citations
Most-cited papers
  • Image Generation From Layout
    2018 · 145 citations
  • Rag-driver: Generalisable Driving Explanations With Retrieval-augmented In-context Learning In Multi-modal Large Language Model
    2024 · 97 citations
  • Unveiling The Ignorance Of Mllms: Seeing Clearly, Answering Incorrectly
    2024 · 28 citations
  • Omni6dpose: A Benchmark And Model For Universal 6D Object Pose Estimation And Tracking
    2024 · 21 citations
  • Enhancing Long Video Understanding Via Hierarchical Event-based Memory
    2024 · 16 citations
  • Synartifact: Classifying And Alleviating Artifacts In Synthetic Images Via Vision-language Model
    2024 · 14 citations
  • Tele-flm Technical Report
    2024 · 11 citations
  • Megapairs: Massive Data Synthesis For Universal Multimodal Retrieval
    2024 · 2 citations
  • Attribute-guided Image Generation From Layout
    2020 · 1 citations
  • Hidream-i1: A High-efficient Image Generative Foundation Model With Sparse Diffusion Transformer
    2025
  • Robofac: A Comprehensive Framework For Robotic Failure Analysis And Correction
    2025
  • Egograsp: World-space Hand-object Interaction Estimation From Egocentric Videos
    2026
  • Probing Visual Planning In Image Editing Models
    2026
Topics
Vision-LanguageImage GenerationModel ArchitectureImage RestorationTraining TechniquesEvaluationObject Detection3D VisionVision-Language ModelsBenchmarks

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.