Adaptive multi-feature fusion via cross-entropy normalization for effective image retrieval

被引:15
作者
Ma, Wentao [1 ]
Zhou, Tongqing [1 ]
Qin, Jiaohua [2 ]
Xiang, Xuyu [2 ]
Tan, Yun [2 ]
Cai, Zhiping [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha 410073, Hunan, Peoples R China
[2] Cent South Univ Forestry & Technol, Coll Comp Sci & Informat Technol, Changsha 410000, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Image retrieval; Cross-entropy; Feature fusion; High-level semantic features; SELECTIVE RANK FUSION; GRAPH; CLASSIFICATION;
D O I
10.1016/j.ipm.2022.103119
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-feature fusion has achieved gratifying performance in image retrieval. However, some existing fusion mechanisms would unfortunately make the result worse than expected due to the domain and visual diversity of images. As a result, a burning problem for applying feature fusion mechanism is how to figure out and improve the complementarity of multi-level heterogeneous features. To this end, this paper proposes an adaptive multi-feature fusion method via cross-entropy normalization for effective image retrieval. First, various low-level features (e.g., SIFT) and high-level semantic features based on deep learning are extracted. Under each level of feature representation, the initial similarity scores of the query image w.r.t. the target dataset are calculated. Second, we use an independent reference dataset to approximate the tail of the attained initial similarity score ranking curve by cross-entropy normalization. Then the area under the ranking curve is calculated as the indicator of the merit of corresponding feature (i.e., a smaller area indicates a more suitable feature.). Finally, fusion weights of each feature are assigned adaptively by the statistically elaborated areas. Extensive experiments on three public benchmark datasets have demonstrated that the proposed method can achieve superior performance compared with the existing methods, improving the metrics mAP by relatively 1.04% (for Holidays), 1.22% (for Oxf5k) and the N-S by relatively 0.04 (for UKbench), respectively.
引用
收藏
页数:17
相关论文
共 59 条
  • [1] Deep learning-based sentiment classification of evaluative text based on Multi-feature fusion
    Abdi, Asad
    Shamsuddin, Siti Mariyam
    Hasan, Shafaatunnur
    Piran, Jalil
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (04) : 1245 - 1259
  • [2] Large-scale instance-level image retrieval
    Amato, Giuseppe
    Carrara, Fabio
    Falchi, Fabrizio
    Gennaro, Claudio
    Vadicamo, Lucia
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
  • [3] Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]
  • [4] The Inverted Multi-Index
    Babenko, Artem
    Lempitsky, Victor
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (06) : 1247 - 1260
  • [5] Neural Codes for Image Retrieval
    Babenko, Artem
    Slesarev, Anton
    Chigorin, Alexandr
    Lempitsky, Victor
    [J]. COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 : 584 - 599
  • [6] Object-Based Aggregation of Deep Features for Image Retrieval
    Bao, Yu
    Li, Haojie
    [J]. MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 : 478 - 489
  • [7] Bhowmik N, 2014, IEEE IMAGE PROC, P5766, DOI 10.1109/ICIP.2014.7026166
  • [8] Chen X, 2009, LECT NOTES ARTIF INT, V5476, P867, DOI 10.1007/978-3-642-01307-2_90
  • [9] SCRATCH: A Scalable Discrete Matrix Factorization Hashing Framework for Cross-Modal Retrieval
    Chen, Zhen-Duo
    Li, Chuan-Xiang
    Luo, Xin
    Nie, Liqiang
    Zhang, Wei
    Xu, Xin-Shun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) : 2262 - 2275
  • [10] Chen ZQ, 2018, IEEE IMAGE PROC, P1982, DOI 10.1109/ICIP.2018.8451486