Google Landmarks Dataset V2 -- A Large-scale Benchmark For Instance-level Recognition And Retrieval
2020 Β· Tobias Weyand, Andre Araujo, Bingyi Cao, et al.
Abstract
While image retrieval and instance recognition techniques are progressing rapidly, there is a need for challenging datasets to accurately measure their performance -- while posing novel challenges that are relevant for practical applications. We introduce the Google Landmarks Dataset v2 (GLDv2), a new benchmark for large-scale, fine-grained instance recognition and image retrieval in the domain of human-made and natural landmarks. GLDv2 is the largest such dataset to date by a large margin, including over 5M images and 200k distinct instance labels. Its test set consists of 118k images with ground truth annotations for both the retrieval and recognition tasks. The ground truth construction involved over 800 hours of human annotator work. Our new dataset has several challenging properties inspired by real world applications that previous datasets did not consider: An extremely long-tailed class distribution, a large fraction of out-of-domain test photos and large intra-class variability
Authors
(none)
Tags
Stats
Related papers
- Large-scale Landmark Retrieval/recognition Under A Noisy And Diverse Dataset (2019)0.00
- Semi-supervised Exploration In Image Retrieval (2019)0.00
- A Benchmark On Tricks For Large-scale Image Retrieval (2019)0.00
- Large-scale Image Retrieval With Attentive Deep Local Features (2016)30.63
- 3rd Place Solution To "google Landmark Retrieval 2020" (2020)0.00
- Detect-to-retrieve: Efficient Regional Aggregation For Image Search (2018)24.71
- Two-stage Discriminative Re-ranking For Large-scale Landmark Retrieval (2020)15.20
- Efficient Large-scale Image Retrieval With Deep Feature Orthogonality And Hybrid-swin-transformers (2021)0.00