Transformer-based Clipped Contrastive Quantization Learning For Unsupervised Image Retrieval
2024 Β· Ayush Dubey, Shiv Ram Dubey, Satish Kumar Singh, et al.
Abstract
Unsupervised image retrieval aims to learn the important visual characteristics without any given level to retrieve the similar images for a given query image. The Convolutional Neural Network (CNN)-based approaches have been extensively exploited with self-supervised contrastive learning for image hashing. However, the existing approaches suffer due to lack of effective utilization of global features by CNNs and biased-ness created by false negative pairs in the contrastive learning. In this paper, we propose a TransClippedCLR model by encoding the global context of an image using Transformer having local context through patch based processing, by generating the hash codes through product quantization and by avoiding the potential false negative pairs through clipped contrastive learning. The proposed model is tested with superior performance for unsupervised image retrieval on benchmark datasets, including CIFAR10, NUS-Wide and Flickr25K, as compared to the recent state-of-the-art de
Authors
(none)
Tags
Stats
Related papers
- Self-supervised Consistent Quantization For Fully Unsupervised Image Retrieval (2022)0.00
- Convolutional Patch Representations For Image Retrieval: An Unsupervised Approach (2016)12.47
- Unsupervised Triplet Hashing For Fast Image Retrieval (2017)12.10
- Self-supervised Product Quantization For Deep Unsupervised Image Retrieval (2021)13.44
- Unsupervised Dense Retrieval With Conterfactual Contrastive Learning (2024)0.00
- Enhancing Image Retrieval : A Comprehensive Study On Photo Search Using The CLIP Mode (2024)0.00
- Evaluating Contrastive Models For Instance-based Image Retrieval (2021)5.24
- Optimizing CLIP Models For Image Retrieval With Maintained Joint-embedding Alignment (2024)6.34