Awesome Similarity Search
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Yansong Tang

16 papers · 5 citations
Most-cited papers
  • LAVT: Language-aware Vision Transformer For Referring Image Segmentation
    2021 · 335 citations
  • Scalablevit: Rethinking The Context-oriented Generalization Of Vision Transformer
    2022 · 45 citations
  • Learning From Temporal Spatial Cubism For Cross-dataset Skeleton-based Action Recognition
    2022 · 20 citations
  • MADTP: Multimodal Alignment-guided Dynamic Token Pruning For Accelerating Vision-language Transformer
    2024 · 17 citations
  • Atp-llava: Adaptive Token Pruning For Large Vision Language Models
    2024 · 14 citations
  • SAM2-LOVE: Segment Anything Model 2 In Language-aided Audio-visual Scenes
    2025 · 4 citations
  • FADE: Frequency-aware Diffusion Model Factorization For Video Editing
    2025 · 1 citations
  • Flash-vstream: Efficient Real-time Understanding For Long Video Streams
    2025
  • Meta-cot: Enhancing Granularity And Generalization In Image Editing
    2026
Topics
3D VisionVisual LanguageVideo-LanguageVision-Language ModelsSegmentationImage GenerationImage RestorationObject DetectionVideo UnderstandingTracking

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.