Awesome Speech Audio
πŸ“„Papers🧭TopicsπŸ‘₯AuthorsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸ“šPacksπŸ› οΈToolsπŸ“BlogsπŸ€–Ask AIβœ‰οΈNewsletterπŸš€Pro
+ Add Paper

← authors Β· overview

Tong Wang

10 papers Β· 1 citations
Most-cited papers
  • MSSDF: Modality-shared Self-supervised Distillation For High-resolution Multi-modal Remote Sensing Image Learning
    2025 Β· 1 citations
  • Think Before You Segment: An Object-aware Reasoning Agent For Referring Audio-visual Segmentation
    2025
  • Composed Object Retrieval: Object-level Retrieval Via Composed Expressions
    2025
  • See Further, Think Deeper: Advancing Vlm's Reasoning Ability With Low-level Visual Cues And Reflection
    2026
Topics
Visual QA & ReasoningVision-Language ModelsUncategorizedAudio-VisualInstruction TuningEmbodied & AgentsImage-Text RetrievalBenchmarks

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.