Multi-spectral Remote Sensing Image Retrieval Using Geospatial Foundation Models
2024 Β· Benedikt Blumenstiel, Viktoria Moor, Romeo Kienzler, et al.
Abstract
Image retrieval enables an efficient search through vast amounts of satellite imagery and returns similar images to a query. Deep learning models can identify images across various semantic concepts without the need for annotations. This work proposes to use Geospatial Foundation Models, like Prithvi, for remote sensing image retrieval with multiple benefits: i) the models encode multi-spectral satellite data and ii) generalize without further fine-tuning. We introduce two datasets to the retrieval task and observe a strong performance: Prithvi processes six bands and achieves a mean Average Precision of 97.62% on BigEarthNet-43 and 44.51% on ForestNet-12, outperforming other RGB-based models. Further, we evaluate three compression methods with binarized embeddings balancing retrieval speed and accuracy. They match the retrieval speed of much shorter hash codes while maintaining the same accuracy as floating-point embeddings but with a 32-fold compression. The code is available at http
Authors
(none)
Tags
Stats
Related papers
- Exploiting Deep Features For Remote Sensing Image Retrieval: A Systematic Investigation (2017)14.47
- CMIR-NET : A Deep Learning Based Model For Cross-modal Retrieval In Remote Sensing (2019)13.34
- Aggregated Deep Local Features For Remote Sensing Image Retrieval (2019)14.11
- Composed Image Retrieval For Remote Sensing (2024)11.03
- Patternnet: A Benchmark Dataset For Performance Evaluation Of Remote Sensing Image Retrieval (2017)19.98
- A Novel Graph-theoretic Deep Representation Learning Method For Multi-label Remote Sensing Image Retrieval (2021)8.82
- Fast-then-fine: A Two-stage Framework With Multi-granular Representation For Cross-modal Retrieval In Remote Sensing (2026)0.00
- Img2loc: Revisiting Image Geolocalization Using Multi-modality Foundation Models And Image-based Retrieval-augmented Generation (2024)9.23