Awesome Speech Audio
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Yong Zhang

16 papers · 685 citations
Most-cited papers
  • Evalcrafter: Benchmarking And Evaluating Large Video Generation Models
    2023 · 281 citations
  • Superfiltering: Weak-to-strong Data Filtering For Fast Instruction-tuning
    2024 · 134 citations
  • E4srec: An Elegant Effective Efficient Extensible Solution Of Large Language Models For Sequential Recommendation
    2023 · 77 citations
  • OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models
    2024 · 77 citations
  • Tencent Ml-images: A Large-scale Multi-label Image Database For Visual Representation Learning
    2019 · 57 citations
  • PRCA: Fitting Black-box Large Language Models For Retrieval Question Answering Via Pluggable Reward-driven Contextual Adapter
    2023 · 49 citations
  • Exact Adversarial Attack To Image Captioning Via Structured Output Learning With Latent Variables
    2019 · 42 citations
  • On The Cultural Gap In Text-to-image Generation
    2023 · 10 citations
  • Ditctrl: Exploring Attention Control In Multi-modal Diffusion Transformer For Tuning-free Multi-prompt Longer Video Generation
    2024 · 8 citations
  • Divprune: Diversity-based Visual Token Pruning For Large Multimodal Models
    2025 · 6 citations
Topics
Fine-TuningModel ArchitectureImage GenerationEfficiencyTraining TechniquesVisual LanguageEvaluationObject DetectionRAGReinforcement Learning

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.