Irgen: Generative Modeling For Image Retrieval
2023 Β· Yidan Zhang, Ting Zhang, Dong Chen, et al.
Abstract
While generative modeling has become prevalent across numerous research fields, its integration into the realm of image retrieval remains largely unexplored and underjustified. In this paper, we present a novel methodology, reframing image retrieval as a variant of generative modeling and employing a sequence-to-sequence model. This approach is harmoniously aligned with the current trend towards unification in research, presenting a cohesive framework that allows for end-to-end differentiable searching. This, in turn, facilitates superior performance via direct optimization techniques. The development of our model, dubbed IRGen, addresses the critical technical challenge of converting an image into a concise sequence of semantic units, which is pivotal for enabling efficient and effective search. Extensive experiments demonstrate that our model achieves state-of-the-art performance on three widely-used image retrieval benchmarks as well as two million-scale datasets, yielding significa
Authors
(none)
Tags
Stats
Related papers
- Genir: Generative Visual Feedback For Mental Image Retrieval (2025)0.00
- Scalable And Effective Generative Information Retrieval (2023)10.48
- Generative Retrieval As Dense Retrieval (2023)0.00
- Imagerag: Dynamic Image Retrieval For Reference-guided Image Generation (2025)0.00
- Binary Generative Adversarial Networks For Image Retrieval (2017)18.34
- Generative Adversarial Nets For Information Retrieval: Fundamentals And Advances (2018)0.00
- Content-based Search For Deep Generative Models (2022)6.34
- Generative Retrieval As Multi-vector Dense Retrieval (2024)8.60