Awesome Papers
LLMs
Quantum
SimSearch
AI4Code
Agents
CV
Robotics
Cyber
AI4Sci
Speech
RL
MM
GenAI
Graph
TS
RecSys
FL
☾
☀
← authors
·
overview
Pengfei Wan
57
papers ·
1
citations
Most-cited papers
Diadem: Advancing Dialogue Descriptions In Audiovisual Video Captioning For Multimodal Large Language Models
2026
Topics
Audio-Visual
Vision-Language Models
Video-Language
Benchmarks
🤖
Ask AI