Locating Target Regions for Image Retrieval in an Unsupervised Manner

被引:0
作者
Zhang, Bo-Jian [1 ]
Liu, Guang-Hai [1 ]
Li, Zuo-Yong [2 ]
Song, Shu-Xiang [3 ]
机构
[1] Guangxi Normal Univ, Coll Comp Sci & Engn, Guilin 541004, Peoples R China
[2] Minjiang Univ, Coll Comp & Control Engn, Fuzhou 350121, Peoples R China
[3] Guangxi Normal Univ, Coll Elect Engn, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
Attention accumulation; feature aggregation; image retrieval; twice-PCA-whitening (TPW); FEATURES;
D O I
10.1109/TNNLS.2024.3363163
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image retrieval performance can be improved by training a convolutional neural network (CNN) model with annotated data to facilitate accurate localization of target regions. However, obtaining sufficiently annotated data is expensive and impractical in real settings. It is challenging to achieve accurate localization of target regions in an unsupervised manner. To address this problem, we propose a new unsupervised image retrieval method named unsupervised target region localization (UTRL) descriptors. It can precisely locate target regions without supervisory information or learning. Our method contains three highlights: 1) we propose a novel zero-label transfer learning method to address the problem of co-localization in target regions. This enhances the potential localization ability of pretrained CNN models through a zero-label data-driven approach; 2) we propose a multiscale attention accumulation method to accurately extract distinguishable target features. It distinguishes the importance of features by using local Gaussian weights; and 3) we propose a simple yet effective method to reduce vector dimensionality, named twice-PCA-whitening (TPW), which reduces the performance degradation caused by feature compression. Notably, TPW is a robust and general method that can be widely applied to image retrieval tasks to improve retrieval performance. This work also facilitates the development of image retrieval based on short vector features. Extensive experiments on six popular benchmark datasets demonstrate that our method achieves about 7% greater mean average precision (mAP) compared to existing state-of-the-art unsupervised methods.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 67 条
  • [1] Anand S., 2021, Proc. SPIE, V11843, P166
  • [2] Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]
  • [3] Ensemble Deep Manifold Similarity Learning using Hard Proxies
    Aziere, Nicolas
    Todorovic, Sinisa
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7291 - 7299
  • [4] Aggregating Deep Convolutional Features for Image Retrieval
    Babenko, Artem
    Lempitsky, Victor
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1269 - 1277
  • [5] Neural Codes for Image Retrieval
    Babenko, Artem
    Slesarev, Anton
    Chigorin, Alexandr
    Lempitsky, Victor
    [J]. COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 : 584 - 599
  • [6] Unsupervised Adversarial Instance-Level Image Retrieval
    Bai, Cong
    Li, Hongkai
    Zhang, Jinglin
    Huang, Ling
    Zhang, Lu
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2199 - 2207
  • [7] Bingyi Cao, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12365), P726, DOI 10.1007/978-3-030-58565-5_43
  • [8] Efficient Object Embedding for Spliced Image Retrieval
    Chen, Bor-Chun
    Wu, Zuxuan
    Davis, Larry S.
    Lim, Ser-Nam
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14960 - 14970
  • [9] Deep Learning for Instance Retrieval: A Survey
    Chen, Wei
    Liu, Yu
    Wang, Weiping
    Bakker, Erwin M.
    Georgiou, Theodoros
    Fieguth, Paul
    Liu, Li
    Lew, Michael S.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7270 - 7292
  • [10] Total recall: Automatic query expansion with a generative feature model for object retrieval
    Chum, Ondrej
    Philbin, James
    Sivic, Josef
    Isard, Michael
    Zisserman, Andrew
    [J]. 2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 496 - +