Sampled Image Tagging And Retrieval Methods On User Generated Content
2016 Β· Karl Ni, Kyle Zaragoza, Charles Foster, et al.
Abstract
Traditional image tagging and retrieval algorithms have limited value as a result of being trained with heavily curated datasets. These limitations are most evident when arbitrary search words are used that do not intersect with training set labels. Weak labels from user generated content (UGC) found in the wild (e.g., Google Photos, FlickR, etc.) have an almost unlimited number of unique words in the metadata tags. Prior work on word embeddings successfully leveraged unstructured text with large vocabularies, and our proposed method seeks to apply similar cost functions to open source imagery. Specifically, we train a deep learning image tagging and retrieval system on large scale, user generated content (UGC) using sampling methods and joint optimization of word embeddings. By using the Yahoo! FlickR Creative Commons (YFCC100M) dataset, such an approach builds robustness to common unstructured data issues that include but are not limited to irrelevant tags, misspellings, multiple lan
Authors
(none)
Tags
Stats
Related papers
- Weakly Supervised Deep Image Hashing Through Tag Embeddings (2018)11.29
- Dynamic Spatial Verification For Large-scale Object-level Image Retrieval (2019)0.00
- Object-centric Open-vocabulary Image-retrieval With Aggregated Features (2023)0.00
- Urbancross: Enhancing Satellite Image-text Retrieval With Cross-domain Adaptation (2024)6.34
- Webly Supervised Joint Embedding For Cross-modal Image-text Retrieval (2018)13.17
- Pic2word: Mapping Pictures To Words For Zero-shot Composed Image Retrieval (2023)20.24
- A Feature-space Multimodal Data Augmentation Technique For Text-video Retrieval (2022)12.43
- Flickr30k-cfq: A Compact And Fragmented Query Dataset For Text-image Retrieval (2024)3.58