← authors · overview

Fahad Shahbaz Khan

25 papers · 1 citations

Most-cited papers

Multi-stage Progressive Image Restoration
2021 · 1754 citations
Discriminative Scale Space Tracking
2016 · 1165 citations
OW-DETR: Open-world Detection Transformer
2021 · 202 citations
Geochat: Grounded Large Vision-language Model For Remote Sensing
2023 · 183 citations
Fine-tuned CLIP Models Are Efficient Video Learners
2022 · 162 citations
A Generative Appearance Model For End-to-end Video Object Segmentation
2018 · 157 citations
Composed Video Retrieval Via Enriched Context And Discriminative Embeddings
2024 · 13 citations
A Culturally-diverse Multilingual Multimodal Video Benchmark & Model
2025 · 1 citations
Videomolmo: Spatio-temporal Grounding Meets Pointing
2025
Lawdis: Language-window-based Controllable Dichotomous Image Segmentation
2025
Terrafm: A Scalable Foundation Model For Unified Multisensor Earth Observation
2025
Ragnet: Large-scale Reasoning-based Affordance Segmentation Benchmark Towards General Grasping
2025
Beyond Simple Edits: Composed Video Retrieval With Dense Modifications
2025
Composed Object Retrieval: Object-level Retrieval Via Composed Expressions
2025
Come-vl: Scaling Complementary Multi-encoder Vision-language Learning
2026

Topics

Vision-Language Models Uncategorized Image Retrieval Video-Language Benchmarks Object Detection Image Restoration Image Generation Tracking Visual Language