Awesome Speech Audio
πŸ“„Papers🧭TopicsπŸ‘₯AuthorsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸ“šPacksπŸ› οΈToolsπŸ“BlogsπŸ€–Ask AIβœ‰οΈNewsletterπŸš€Pro
+ Add Paper

← authors Β· overview

Yan Zhou

10 papers Β· 5 citations
Most-cited papers
  • Dt-nerf: A Diffusion And Transformer-based Optimization Approach For Neural Radiance Fields In 3D Reconstruction
    2025 Β· 4 citations
  • Diffcap: Diffusion-based Real-time Human Motion Capture Using Sparse Imus And A Monocular Camera
    2025 Β· 1 citations
  • Can Multimodal Large Language Models Understand Spatial Relations?
    2025
  • MIDAS: Multimodal Interactive Digital-human Synthesis Via Real-time Autoregressive Video Generation
    2025
Topics
UncategorizedVision-Language ModelsBenchmarksVideo-LanguageAudio-Visual

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.