A Feature Analysis For Multimodal News Retrieval
2020 · Golsa Tahmasebzadeh, Sherzod Hakimov, Eric Müller-Budack, et al.
Abstract
Content-based information retrieval is based on the information contained in documents rather than using metadata such as keywords. Most information retrieval methods are either based on text or image. In this paper, we investigate the usefulness of multimodal features for cross-lingual news search in various domains: politics, health, environment, sport, and finance. To this end, we consider five feature types for image and text and compare the performance of the retrieval system using different combinations. Experimental results show that retrieval results can be improved when considering both visual and textual information. In addition, it is observed that among textual features entity overlap outperforms word embeddings, while geolocation embeddings achieve better performance among visual features in the retrieval task.
Authors
(none)
Tags
Stats
Related papers
- Revisiting Cross Modal Retrieval (2018)0.00
- New Ideas And Trends In Deep Multimodal Content Understanding: A Review (2020)12.10
- Mm-embed: Universal Multimodal Retrieval With Multimodal Llms (2024)0.00
- Machine Learning Methods For Multimedia Information Retrieval (2017)0.00
- Cross-modal Retrieval: A Systematic Review Of Methods And Future Directions (2023)12.81
- Multimodal Representation Alignment For Cross-modal Information Retrieval (2025)0.00
- Image Search Using Multilingual Texts: A Cross-modal Learning Approach Between Image And Text (2019)0.00
- Multimodal Semantic Retrieval For Product Search (2025)3.58