Adaptive multi-feature fusion via cross-entropy normalization for effective image retrieval

被引：15

作者：

Ma, Wentao ^{[1
]}

Zhou, Tongqing ^{[1
]}

Qin, Jiaohua ^{[2
]}

Xiang, Xuyu ^{[2
]}

Tan, Yun ^{[2
]}

Cai, Zhiping ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Comp, Changsha 410073, Hunan, Peoples R China

[2] Cent South Univ Forestry & Technol, Coll Comp Sci & Informat Technol, Changsha 410000, Hunan, Peoples R China

来源：

INFORMATION PROCESSING & MANAGEMENT | 2023年 / 60卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Image retrieval; Cross-entropy; Feature fusion; High-level semantic features; SELECTIVE RANK FUSION; GRAPH; CLASSIFICATION;

D O I：

10.1016/j.ipm.2022.103119

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-feature fusion has achieved gratifying performance in image retrieval. However, some existing fusion mechanisms would unfortunately make the result worse than expected due to the domain and visual diversity of images. As a result, a burning problem for applying feature fusion mechanism is how to figure out and improve the complementarity of multi-level heterogeneous features. To this end, this paper proposes an adaptive multi-feature fusion method via cross-entropy normalization for effective image retrieval. First, various low-level features (e.g., SIFT) and high-level semantic features based on deep learning are extracted. Under each level of feature representation, the initial similarity scores of the query image w.r.t. the target dataset are calculated. Second, we use an independent reference dataset to approximate the tail of the attained initial similarity score ranking curve by cross-entropy normalization. Then the area under the ranking curve is calculated as the indicator of the merit of corresponding feature (i.e., a smaller area indicates a more suitable feature.). Finally, fusion weights of each feature are assigned adaptively by the statistically elaborated areas. Extensive experiments on three public benchmark datasets have demonstrated that the proposed method can achieve superior performance compared with the existing methods, improving the metrics mAP by relatively 1.04% (for Holidays), 1.22% (for Oxf5k) and the N-S by relatively 0.04 (for UKbench), respectively.

引用

页数：17

共 59 条

[1] Deep learning-based sentiment classification of evaluative text based on Multi-feature fusion
Abdi, Asad
Shamsuddin, Siti Mariyam
Hasan, Shafaatunnur
Piran, Jalil
[J]. INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (04) : 1245 - 1259
[2] Large-scale instance-level image retrieval
Amato, Giuseppe
Carrara, Fabio
Falchi, Fabrizio
Gennaro, Claudio
Vadicamo, Lucia
[J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
[3] Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]
[4] The Inverted Multi-Index
Babenko, Artem
Lempitsky, Victor
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (06) : 1247 - 1260
[5] Neural Codes for Image Retrieval
Babenko, Artem
Slesarev, Anton
Chigorin, Alexandr
Lempitsky, Victor
[J]. COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 : 584 - 599
[6] Object-Based Aggregation of Deep Features for Image Retrieval
Bao, Yu
Li, Haojie
[J]. MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 : 478 - 489
[7] Bhowmik N, 2014, IEEE IMAGE PROC, P5766, DOI 10.1109/ICIP.2014.7026166
[8] Chen X, 2009, LECT NOTES ARTIF INT, V5476, P867, DOI 10.1007/978-3-642-01307-2_90
[9] SCRATCH: A Scalable Discrete Matrix Factorization Hashing Framework for Cross-Modal Retrieval
Chen, Zhen-Duo
Li, Chuan-Xiang
Luo, Xin
Nie, Liqiang
Zhang, Wei
Xu, Xin-Shun
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) : 2262 - 2275
[10] Chen ZQ, 2018, IEEE IMAGE PROC, P1982, DOI 10.1109/ICIP.2018.8451486

← 1 2 3 4 5 6 →