Awesome Speech Audio
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Mubarak Shah

20 papers · 1 citations
Most-cited papers
  • OW-DETR: Open-world Detection Transformer
    2021 · 202 citations
  • Cross-view Image Matching For Geo-localization In Urban Environments
    2017 · 193 citations
  • Deep Affinity Network For Multiple Object Tracking
    2018 · 176 citations
  • Bridging The Domain Gap For Ground-to-aerial Image Matching
    2019 · 141 citations
  • TCLR: Temporal Contrastive Learning For Video Representation
    2021 · 126 citations
  • Clusternet: Detecting Small Objects In Large Scenes By Exploiting Spatio-temporal Information
    2017 · 102 citations
  • \(r^{2}\)former: Unified \(r\)etrieval And \(r\)eranking Transformer For Place Recognition
    2023 · 101 citations
  • \(r^{2}\)former: Unified \(r\)etrieval And \(r\)eranking Transformer For Place Recognition
    2023 · 101 citations
  • Large-scale Image Geo-localization Using Dominant Sets
    2017 · 30 citations
  • A Culturally-diverse Multilingual Multimodal Video Benchmark & Model
    2025 · 1 citations
  • Agent-x: Evaluating Deep Multimodal Reasoning In Vision-centric Agentic Tasks
    2025
  • Beyond Simple Edits: Composed Video Retrieval With Dense Modifications
    2025
  • Soccerchat: Integrating Multimodal Data For Enhanced Soccer Game Understanding
    2025
Topics
UncategorizedBenchmarksObject DetectionVideo-LanguageTrackingVideo UnderstandingVisual QA & ReasoningEmbodied & AgentsVision-Language ModelsImage-Text Retrieval

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.