Awesome Papers
LLMsQuantumSimSearchAI4CodeAgentsCVRoboticsCyberAI4SciSpeechRLMMGenAIGraphTSRecSysFL

← authors · overview

Yansong Tang

16 papers · 5 citations
Most-cited papers
  • LAVT: Language-aware Vision Transformer For Referring Image Segmentation
    2021 · 335 citations
  • SAM2-LOVE: Segment Anything Model 2 In Language-aided Audio-visual Scenes
    2025 · 4 citations
  • Flash-vstream: Efficient Real-time Understanding For Long Video Streams
    2025
  • Meta-cot: Enhancing Granularity And Generalization In Image Editing
    2026
Topics
Vision-Language ModelsVideo-LanguageSegmentationVisual Language3D VisionImage GenerationImage RestorationAudio-VisualVisual QA & Reasoning

Privacy · Terms

© 2026 Awesome Papers.