Deep Learning Techniques For Future Intelligent Cross-media Retrieval
2020 Β· Sadaqat Ur Rehman, Muhammad Waqas, Shanshan Tu, et al.
Abstract
With the advancement in technology and the expansion of broadcasting, cross-media retrieval has gained much attention. It plays a significant role in big data applications and consists in searching and finding data from different types of media. In this paper, we provide a novel taxonomy according to the challenges faced by multi-modal deep learning approaches in solving cross-media retrieval, namely: representation, alignment, and translation. These challenges are evaluated on deep learning (DL) based methods, which are categorized into four main groups: 1) unsupervised methods, 2) supervised methods, 3) pairwise based methods, and 4) rank based methods. Then, we present some well-known cross-media datasets used for retrieval, considering the importance of these datasets in the context in of deep learning based cross-media retrieval approaches. Moreover, we also present an extensive review of the state-of-the-art problems and its corresponding solutions for encouraging deep learning i
Authors
(none)
Tags
Stats
Related papers
- Cross-modal Retrieval: A Systematic Review Of Methods And Future Directions (2023)12.81
- A Multimodal Deep Learning Framework For Scalable Content Based Visual Media Retrieval (2021)0.00
- Cross-media Scientific Research Achievements Retrieval Based On Deep Language Model (2022)0.00
- Scientific And Technological Information Oriented Semantics-adversarial And Media-adversarial Cross-media Retrieval (2022)0.00
- A New Benchmark And Approach For Fine-grained Cross-media Retrieval (2019)17.33
- Deep Learning For Instance Retrieval: A Survey (2021)16.05
- Cross-view Image Retrieval -- Ground To Aerial Image Retrieval Through Deep Learning (2020)5.24
- Learning Joint Embedding For Cross-modal Retrieval (2019)5.84