A Multimodal Deep Learning Framework For Scalable Content Based Visual Media Retrieval
2021 Β· Ambareesh Ravi, Amith Nandakumar
Abstract
We propose a novel, efficient, modular and scalable framework for content based visual media retrieval systems by leveraging the power of Deep Learning which is flexible to work both for images and videos conjointly and we also introduce an efficient comparison and filtering metric for retrieval. We put forward our findings from critical performance tests comparing our method to the predominant conventional approach to demonstrate the feasibility and efficiency of the proposed solution with best practices, possible improvements that may further augment the ability of retrieval architectures.
Authors
(none)
Tags
Stats
Related papers
- Compressible And Searchable: Ai-native Multi-modal Retrieval System With Learned Image Compression (2024)0.00
- Deep Learning Techniques For Future Intelligent Cross-media Retrieval (2020)0.00
- Verve: Versatile Retrieval For Videos Via Unified Embeddings (2026)0.00
- A Fast Content-based Image Retrieval Method Using Deep Visual Features (2019)6.77
- Deep Semantic Multimodal Hashing Network For Scalable Image-text And Video-text Retrievals (2019)14.43
- Multimodal Lengthy Videos Retrieval Framework And Evaluation Metric (2025)0.00
- Deepimagesearch: Benchmarking Multimodal Agents For Context-aware Image Retrieval In Visual Histories (2026)0.00
- Embedding-based Retrieval In Multimodal Content Moderation (2025)2.26