LOCORE: Image Re-ranking With Long-context Sequence Modeling
2025 Β· Zilin Xiao, Pavel Suma, Ayush Sachdeva, et al.
Abstract
We introduce LOCORE, Long-Context Re-ranker, a model that takes as input local descriptors corresponding to an image query and a list of gallery images and outputs similarity scores between the query and each gallery image. This model is used for image retrieval, where typically a first ranking is performed with an efficient similarity measure, and then a shortlist of top-ranked images is re-ranked based on a more fine-grained similarity measure. Compared to existing methods that perform pair-wise similarity estimation with local descriptors or list-wise re-ranking with global descriptors, LOCORE is the first method to perform list-wise re-ranking with local descriptors. To achieve this, we leverage efficient long-context sequence models to effectively capture the dependencies between query and gallery images at the local-descriptor level. During testing, we process long shortlists with a sliding window strategy that is tailored to overcome the context size limitations of sequence mode
Authors
(none)
Tags
Stats
Related papers
- Global-to-local Or Local-to-global? Enhancing Image Retrieval With Efficient Local Search And Effective Global Re-ranking (2025)0.00
- Chain-of-thought Re-ranking For Image Retrieval Tasks (2025)1.81
- Moving Towards Centers: Re-ranking With Attention And Memory For Re-identification (2021)8.09
- Mcot-re: Multi-faceted Chain-of-thought And Re-ranking For Training-free Zero-shot Composed Image Retrieval (2025)0.00
- Contextual Similarity Aggregation With Self-attention For Visual Re-ranking (2021)0.00
- Learning A Deep Listwise Context Model For Ranking Refinement (2018)15.85
- CODER: An Efficient Framework For Improving Retrieval Through Contextual Document Embedding Reranking (2021)7.16
- Visual Re-ranking With Non-visual Side Information (2025)0.00