Awesome Speech Audio
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Wenhu Chen

13 papers · 0 citations
Most-cited papers
  • Mammoth: Building Math Generalist Models Through Hybrid Instruction Tuning
    2023 · 554 citations
  • Mammoth2: Scaling Instructions From The Web
    2024 · 163 citations
  • Videoscore: Building Automatic Metrics To Simulate Fine-grained Human Feedback For Video Generation
    2024 · 155 citations
  • Unifying Multimodal Retrieval Via Document Screenshot Embedding
    2024 · 103 citations
  • Murag: Multimodal Retrieval-augmented Generator For Open Question Answering Over Images And Text
    2022 · 89 citations
  • Uniir: Training And Benchmarking Universal Multimodal Information Retrievers
    2023 · 24 citations
  • Unifying Multimodal Retrieval Via Document Screenshot Embedding
    2024 · 17 citations
  • EDIS: Entity-driven Image Search Over Multimodal Web Content
    2023 · 7 citations
  • Scimmir: Benchmarking Scientific Multi-modal Information Retrieval
    2024 · 4 citations
  • Gemini 2.5: Pushing The Frontier With Advanced Reasoning, Multimodality, Long Context, And Next Generation Agentic Capabilities
    2025
  • Videoscore2: Think Before You Score In Generative Video Evaluation
    2025
  • Verltool: Towards Holistic Agentic Reinforcement Learning With Tool Use
    2025
  • TUNA: Taming Unified Visual Representations For Native Unified Multimodal Models
    2025
  • Videoeval-pro: Robust And Realistic Long Video Understanding Evaluation
    2025
Topics
Image RetrievalBenchmarksVideo-LanguageFine-TuningTraining TechniquesEvaluationEfficiencyCross-Modal HashingVisual QA & ReasoningEmbodied & Agents

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.