IRGen: Generative Modeling for Image Retrieval

被引:0
|
作者
Zhang, Yidan [1 ,2 ]
Zhang, Ting [1 ]
Chen, Dong [3 ]
Wang, Yujing [3 ]
Chen, Qi [3 ]
Xie, Xing [3 ]
Sun, Hao [3 ]
Deng, Weiwei [3 ]
Zhang, Qi [3 ]
Yang, Fan [3 ]
Yang, Mao [3 ]
Liao, Qingmin [5 ]
Wang, Jingdong [4 ]
Guo, Baining [3 ]
机构
[1] Beijing Normal Univ, Beijing, Peoples R China
[2] Univ Tokyo, Bunkyo City, Japan
[3] Microsoft Corp, Redmond, WA 98052 USA
[4] Baidu, Beijing, Peoples R China
[5] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China
来源
关键词
Image Retrieval; Autoregressive Model; Generative Model; PRODUCT QUANTIZATION; DEEP QUANTIZATION; NEAREST-NEIGHBOR; NETWORK; END;
D O I
10.1007/978-3-031-72633-0_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While generative modeling has become prevalent across numerous research fields, its integration into the realm of image retrieval remains largely unexplored and underjustified. In this paper, we present a novel methodology, reframing image retrieval as a variant of generative modeling and employing a sequence-to-sequence model. This approach is harmoniously aligned with the current trend towards unification in research, presenting a cohesive framework that allows for end-to-end differentiable searching. This, in turn, facilitates superior performance via direct optimization techniques. The development of our model, dubbed IRGen, addresses the critical technical challenge of converting an image into a concise sequence of semantic units, which is pivotal for enabling efficient and effective search. Extensive experiments demonstrate that our model achieves state-of-the-art performance on three widely-used image retrieval benchmarks as well as two million-scale datasets, yielding significant improvement compared to prior competitive retrieval methods. In addition, the notable surge in precision scores facilitated by generative modeling presents the potential to bypass the reranking phase, which is traditionally indispensable in practical retrieval workflows. The code is publicly available at https://github.com/yakt00/IRGen.
引用
收藏
页码:21 / 41
页数:21
相关论文
共 50 条
  • [1] Progressive Generative Hashing for Image Retrieval
    Ma, Yuqing
    He, Yue
    Ding, Fan
    Hu, Sheng
    Li, Jun
    Liu, Xianglong
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 871 - 877
  • [2] Binary Generative Adversarial Networks for Image Retrieval
    Song, Jingkuan
    He, Tao
    Gao, Lianli
    Xu, Xing
    Hanjalic, Alan
    Shen, Heng Tao
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 394 - 401
  • [3] Instance Image Retrieval with Generative Adversarial Training
    Li, Hongkai
    Bai, Cong
    Huang, Ling
    Jiang, Yugang
    Chen, Shengyong
    MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 381 - 392
  • [4] A Generative Model for Concurrent Image Retrieval and ROI Segmentation
    Gonzalez-Diaz, Ivan
    Baz-Hormigos, Carlos E.
    Diaz-de-Maria, Fernando
    IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (01) : 169 - 183
  • [5] Redundancy-resistant Generative Hashing for Image Retrieval
    Du, Changying
    Xie, Xingyu
    Du, Changde
    Wang, Hao
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5017 - 5023
  • [6] Machine Unlearning for Image Retrieval: A Generative Scrubbing Approach
    Zhang, Peng-Fei
    Bai, Guangdong
    Huang, Zi
    Xu, Xin-Shun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [7] Image hashing retrieval based on generative adversarial networks
    Lei, Lei
    Guo, Dongen
    Shen, Zhen
    Wu, Zechen
    APPLIED INTELLIGENCE, 2023, 53 (08) : 9056 - 9067
  • [8] Image hashing retrieval based on generative adversarial networks
    Lei Lei
    Dongen Guo
    Zhen Shen
    Zechen Wu
    Applied Intelligence, 2023, 53 : 9056 - 9067
  • [9] Background modeling for generative image models
    Schoenborn, Sandro
    Egger, Bernhard
    Forster, Andreas
    Vetter, Thomas
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 136 : 117 - 127
  • [10] A Review of Generative Image Modeling Techniques
    Bombuwala, Gayashan
    Poravi, Guhanathan
    2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,