Mohamed Elhoseiny
10 papers Β· 0 citations
Most-cited papers
- Reltransformer: A Transformer-based Long-tail Visual Relationship Recognition2021 Β· 19 citations
- Vrsbench: A Versatile Vision-language Benchmark Dataset For Remote Sensing Image Understanding2024 Β· 15 citations
- Exploring Hierarchical Graph Representation For Large-scale Zero-shot Image Classification2022 Β· 10 citations
- Goldfish: Vision-language Understanding Of Arbitrarily Long Videos2024 Β· 6 citations
- Imagecaptioner\(^2\): Image Captioner For Image Captioning Bias Amplification Assessment2023 Β· 6 citations
- Category-level Text-to-image Retrieval Improved: Bridging The Domain Gap With Diffusion Models And Vision Encoders2025
- MAGNET: A Multi-agent Framework For Finding Audio-visual Needles By Reasoning Over Multi-video Haystacks2025
- A Survey On Long-video Storytelling Generation: Architectures, Consistency, And Cinematic Quality2025
- Reefnet: A Large-scale Dataset And Benchmark For Fine-grained Coral Reef Recognition2025
- Fishnet++: Analyzing The Capabilities Of Multimodal Large Language Models In Marine Biology2025
Topics