Awesome Speech Audio
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Fahad Shahbaz Khan

25 papers · 1 citations
Most-cited papers
  • Multi-stage Progressive Image Restoration
    2021 · 1754 citations
  • Discriminative Scale Space Tracking
    2016 · 1165 citations
  • OW-DETR: Open-world Detection Transformer
    2021 · 202 citations
  • Geochat: Grounded Large Vision-language Model For Remote Sensing
    2023 · 183 citations
  • Fine-tuned CLIP Models Are Efficient Video Learners
    2022 · 162 citations
  • A Generative Appearance Model For End-to-end Video Object Segmentation
    2018 · 157 citations
  • Composed Video Retrieval Via Enriched Context And Discriminative Embeddings
    2024 · 13 citations
  • A Culturally-diverse Multilingual Multimodal Video Benchmark & Model
    2025 · 1 citations
  • Videomolmo: Spatio-temporal Grounding Meets Pointing
    2025
  • Lawdis: Language-window-based Controllable Dichotomous Image Segmentation
    2025
  • Terrafm: A Scalable Foundation Model For Unified Multisensor Earth Observation
    2025
  • Ragnet: Large-scale Reasoning-based Affordance Segmentation Benchmark Towards General Grasping
    2025
  • Beyond Simple Edits: Composed Video Retrieval With Dense Modifications
    2025
  • Composed Object Retrieval: Object-level Retrieval Via Composed Expressions
    2025
  • Come-vl: Scaling Complementary Multi-encoder Vision-language Learning
    2026
Topics
Vision-Language ModelsUncategorizedImage RetrievalVideo-LanguageBenchmarksObject DetectionImage RestorationImage GenerationTrackingVisual Language

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.