End-to-End Learning of Deep Visual Representations for Image Retrieval

被引:352
|
作者
Gordo, Albert [1 ]
Almazan, Jon [1 ]
Revaud, Jerome [1 ]
Larlus, Diane [1 ]
机构
[1] Xerox Res Ctr Europe, Comp Vis Grp, Meylan, France
关键词
Deep learning; Instance-level retrieval; Visual search; Visual representation;
D O I
10.1007/s11263-017-1016-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While deep learning has become a key ingredient in the top performing methods for many computer vision tasks, it has failed so far to bring similar improvements to instance-level image retrieval. In this article, we argue that reasons for the underwhelming results of deep methods on image retrieval are threefold: (1) noisy training data, (2) inappropriate deep architecture, and (3) suboptimal training procedure. We address all three issues. First, we leverage a large-scale but noisy landmark dataset and develop an automatic cleaning method that produces a suitable training set for deep retrieval. Second, we build on the recent R-MAC descriptor, show that it can be interpreted as a deep and differentiable architecture, and present improvements to enhance it. Last, we train this network with a siamese architecture that combines three streams with a triplet loss. At the end of the training process, the proposed architecture produces a global image representation in a single forward pass that is well suited for image retrieval. Extensive experiments show that our approach significantly outperforms previous retrieval approaches, including state-of-the-art methods based on costly local descriptor indexing and spatial verification. On Oxford 5k, Paris 6k and Holidays, we respectively report 94.7, 96.6, and 94.8 mean average precision. Our representations can also be heavily compressed using product quantization with little loss in accuracy.
引用
收藏
页码:237 / 254
页数:18
相关论文
共 50 条
  • [1] End-to-End Learning of Deep Visual Representations for Image Retrieval
    Albert Gordo
    Jon Almazán
    Jerome Revaud
    Diane Larlus
    International Journal of Computer Vision, 2017, 124 : 237 - 254
  • [2] End-to-end learning of representations for instance-level document image retrieval
    Liu, Li
    Lu, Yue
    Suen, Ching Y.
    APPLIED SOFT COMPUTING, 2023, 136
  • [3] End-to-end Learning for Encrypted Image Retrieval
    Feng, Qihua
    Li, Peiya
    Lu, ZhiXun
    Liu, Guan
    Huang, Feiran
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1839 - 1845
  • [4] An End-to-End Image Retrieval System Based on Gravitational Field Deep Learning
    Zheng, Qinghe
    Yang, Mingqiang
    Zhang, Qingrui
    Zhang, Xinxin
    Yang, Jiajie
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS, ELECTRONICS AND CONTROL (ICCSEC), 2017, : 936 - 940
  • [5] An End-to-End Learning Architecture for Efficient Image Encoding and Deep Learning
    Chamain, Lahiru D.
    Qi, Siyu
    Ding, Zhi
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 691 - 695
  • [6] A deep learning network based end-to-end image composition
    Zhu, Xiaoyu
    Wang, Haodi
    Zhang, Zhiyi
    Wu, Xiuping
    Guo, Junqi
    Wu, Hao
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 101
  • [7] An End-to-End Image Dehazing Method Based on Deep Learning
    Zhang, Yi
    Huang, Hongbing
    Liu, Junyi
    Fan, Chao
    Wang, Yanyan
    Cai, Qing
    Ruan, Yingying
    Gong, Xiaojin
    2018 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION, IMAGE AND SIGNAL PROCESSING, 2019, 1169
  • [8] Tell, Imagine, and Search: End-to-end Learning for Composing Text and Image to Image Retrieval
    Zhang, Feifei
    Xu, Mingliang
    Xu, Changsheng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (02)
  • [9] An End-to-End Robotic Visual Localization Algorithm Based on Deep Learning
    Wang, Hongcheng
    Chen, Niansheng
    Fan, Guangyu
    Yang, Dingyu
    Rao, Lei
    Cheng, Songlin
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [10] An End-to-End Robotic Visual Localization Algorithm Based on Deep Learning
    Chen, Niansheng
    Wang, Hongcheng
    Fan, Guangyu
    Yang, Dingyu
    Rao, Lei
    JOURNAL OF SENSORS, 2023, 2023