Benchmarking Unsupervised Near-duplicate Image Detection
2019 Β· Lia Morra, Fabrizio Lamberti
Abstract
Unsupervised near-duplicate detection has many practical applications ranging from social media analysis and web-scale retrieval, to digital image forensics. It entails running a threshold-limited query on a set of descriptors extracted from the images, with the goal of identifying all possible near-duplicates, while limiting the false positives due to visually similar images. Since the rate of false alarms grows with the dataset size, a very high specificity is thus required, up to \(1 - 10^\{-9\}\) for realistic use cases; this important requirement, however, is often overlooked in literature. In recent years, descriptors based on deep convolutional neural networks have matched or surpassed traditional feature extraction methods in content-based image retrieval tasks. To the best of our knowledge, ours is the first attempt to establish the performance range of deep learning-based descriptors for unsupervised near-duplicate detection on a range of datasets, encompassing a broad spectr
Authors
(none)
Tags
Stats
Related papers
- Benchmarking Pretrained Vision Embeddings For Near- And Duplicate Detection In Medical Images (2023)7.16
- Dataset And Case Studies For Visual Near-duplicates Detection In The Context Of Social Media (2022)0.00
- CNN Retrieval Based Unsupervised Metric Learning For Near-duplicated Video Retrieval (2021)0.00
- Beyond Supervised Vs. Unsupervised: Representative Benchmarking And Analysis Of Image Representation Learning (2022)8.35
- Unsupervised Deep Features For Remote Sensing Image Matching Via Discriminator Network (2018)8.09
- Unsupervised Multi-criteria Adversarial Detection In Deep Image Retrieval (2023)0.00
- Unsupervised Feature Learning Via Non-parametric Instance-level Discrimination (2018)25.66
- Object Detection Based Deep Unsupervised Hashing (2018)6.34