Awesome Speech Audio
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Tao Zhang

28 papers · 0 citations
Most-cited papers
  • Omg-llava: Bridging Image-level, Object-level, Pixel-level Reasoning And Understanding
    2024 · 150 citations
  • Cfbench: A Comprehensive Constraints-following Benchmark For Llms
    2024 · 51 citations
  • Knowledge Enhanced Multi-intent Transformer Network For Recommendation
    2024 · 31 citations
  • Cof-cot: Enhancing Large Language Models With Coarse-to-fine Chain-of-thought Prompting For Multi-domain NLU Tasks
    2023 · 24 citations
  • Omg-llava: Bridging Image-level, Object-level, Pixel-level Reasoning And Understanding
    2024 · 7 citations
  • Hunyuanimage 3.0 Technical Report
    2025
  • Hunyuanimage 3.0 Technical Report
    2025
  • The 1st Solution For 7th LSVOS RVOS Track: Sasasa2va
    2025
  • The 1st Solution For 7th LSVOS RVOS Track: Sasasa2va
    2025
  • Swe-world: Building Software Engineering Agents In Docker-free Environments
    2026
  • Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation And Methodology
    2025
  • The Determinism Of Randomness: Latent Space Degeneracy In Diffusion Model
    2026
  • Using Large Language Models For Embodied Planning Introduces Systematic Safety Risks
    2026
  • Samtok: Representing Any Mask With Two Words
    2026
  • HAWK: Head Importance-aware Visual Token Pruning In Multimodal Models
    2026
Topics
Model ArchitectureTraining TechniquesVision-Language ModelsVideo UnderstandingRAGEvaluationIn-Context LearningImage GenerationSegmentationVisual Language

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.