Deep Aggregation of Regional Convolutional Activations for Content Based Image Retrieval

被引:4
作者
Schall, Konstantin [1 ]
Barthel, Kai Uwe [1 ]
Hezel, Nico [1 ]
Jung, Klaus [1 ]
机构
[1] HTW Berlin, Visual Comp Grp, Berlin, Germany
来源
2019 IEEE 21ST INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2019) | 2019年
关键词
Computational and artificial intelligence; Multi-layer neural network; Image retrieval; Content-based retrieval; Machine learning; Feature extraction; Machine learning algorithms; Nearest neighbor searches; Computer vision;
D O I
10.1109/mmsp.2019.8901787
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
One of the key challenges of deep learning based image retrieval remains in aggregating convolutional activations into one highly representative feature vector. Ideally, this descriptor should encode semantic, spatial and low level information. Even though off-the-shelf pre-trained neural networks can already produce good representations in combination with aggregation methods, appropriate fine tuning for the task of image retrieval has shown to significantly boost retrieval performance. In this paper we present a simple yet effective supervised aggregation method built on top of existing regional pooling approaches. In addition to the maximum activation of a given region, we calculate regional average activations of extracted feature maps. Subsequently, weights for each of the pooled feature vectors are learned to perform a weighted aggregation to a single feature vector. Furthermore, we apply our newly proposed NRA loss function for deep metric learning to fine tune the backbone neural network and to learn the aggregation weights. Our method achieves state-of-the-art results for the INRIA Holidays data set and competitive results for the Oxford Buildings and Paris data sets while reducing the training time significantly.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] DEEP FEATURE FACTORIZATION FOR CONTENT-BASED IMAGE RETRIEVAL AND LOCALIZATION
    Collins, Edo
    Susstrunk, Sabine
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 874 - 878
  • [32] Deep Learning for Plant Classification and Content-Based Image Retrieval
    Gyires-Toth, Balint Pal
    Osvath, Marton
    Papp, David
    Szucs, Gabor
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2019, 19 (01) : 88 - 100
  • [33] Rethinking the Rotation Invariance of Local Convolutional Features for Content-Based Image Retrieval
    Zhao, Longjiao
    Wang, Yu
    Kato, Jien
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (01) : 174 - 182
  • [34] Performance Analysis of Content Based Image Retrieval Systems
    Gupta, Arko
    Agarwal, Dinesh
    Veenu
    Bhatia, M. P. S.
    2018 INTERNATIONAL CONFERENCE ON COMPUTING, POWER AND COMMUNICATION TECHNOLOGIES (GUCON), 2018, : 899 - 902
  • [35] Content-based Image Retrieval System via Deep Learning Method
    Tian, Xinyu
    Zheng, Qinghe
    Xing, Jianping
    PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), 2018, : 1257 - 1261
  • [36] Relevance Feedback for Content-Based Image Retrieval Using Deep Learning
    Xu, Heng
    Wang, Jun-yi
    Mao, Lei
    2017 2ND INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2017), 2017, : 629 - 633
  • [37] Integrated color, texture and shape information for content-based image retrieval
    Ryszard S. Choraś
    Tomasz Andrysiak
    Michał Choraś
    Pattern Analysis and Applications, 2007, 10 : 333 - 343
  • [38] Integrated color, texture and shape information for content-based image retrieval
    Choras, Ryszard S.
    Andrysiak, Tomasz
    Choras, Michal
    PATTERN ANALYSIS AND APPLICATIONS, 2007, 10 (04) : 333 - 343
  • [39] Content-Based Image Retrieval
    Zaheer, Yasir
    SECOND INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING, 2010, 7546
  • [40] Content Based Image and Video Retrieval
    Patil, Shubhangi H.
    Belegali, P. P.
    Patil, B. S.
    Mohite, T. H.
    Dhanashri, Dhobale D.
    SECOND INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING, 2010, 7546