Multi-level image representation for large-scale image-based instance retrieval

被引：16

作者：

Deng, Qili ^{[1
]}

Wu, Shuai ^{[1
]}

Wen, Jie ^{[1
]}

Xu, Yong ^{[1
]}

机构：

[1] Harbin Inst Technol, Shenzhen Grad Sch, Biocomp Res Ctr, Shenzhen 518055, Peoples R China

来源：

CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY | 2018年 / 3卷 / 01期

关键词：

D O I：

10.1049/trit.2018.0003

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, instance-level-image retrieval has attracted massive attention. Several researchers proposed that the representations learned by convolutional neural network (CNN) can be used for image retrieval task. In this study, the authors propose an effective feature encoder to extract robust information from CNN. It consists of two main steps: the embedding step and the aggregation step. Moreover, they apply the multi-task loss function to train their model in order to make the training process more effective. Finally, this study proposes a novel representation policy that encodes feature vectors extracted from different layers to capture both local patterns and semantic concepts from deep CNN. They call this `multi-level- image representation', which could further improve the performance. The proposed model is helpful to improve the retrieval performance. For the sake of comprehensively evaluating the performance of their approach, they conducted ablation experiments with various convolutional NN architectures. Furthermore, they apply their approach to a concrete challenge - Alibaba large-scale search challenge. The results show that their model is effective and competitive.

引用

页码：33 / 39

页数：7

共 29 条

[1] Andrew Zisserman, 2015, Arxiv, DOI arXiv:1409.1556
[2] Azizpour H., 2014, ARXIV14065774
[3] Neural Codes for Image Retrieval
Babenko, Artem
Slesarev, Anton
Chigorin, Alexandr
Lempitsky, Victor
[J]. COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 : 584 - 599
[4] Bay H., 2006, LNCS, V3951, P404, DOI [10.1007/1174402332, DOI 10.1007/11744023_32]
[5] Deng QL, 2015, 2015 INTERNATIONAL CONFERENCE ON COMPUTERS, COMMUNICATIONS, AND SYSTEMS (ICCCS), P206, DOI 10.1109/CCOMS.2015.7562902
[6] Girshick R., 2014, P IEEE C COMP VIS PA, DOI DOI 10.1109/CVPR.2014.81
[7] Deep Image Retrieval: Learning Global Representations for Image Search
Gordo, Albert
Almazan, Jon
Revaud, Jerome
Larlus, Diane
[J]. COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 241 - 257
[8] Gordo A, 2012, PROC CVPR IEEE, P3045, DOI 10.1109/CVPR.2012.6248035
[9] Guo K, 2017, CAAI T INTELL TECHNO, V2, P39, DOI 10.1016/j.trit.2017.03.001
[10] Cross-domain Image Retrieval with a Dual Attribute-aware Ranking Network
Huang, Junshi
Feris, Rogerio
Chen, Qiang
Yan, Shuicheng
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1062 - 1070

← 1 2 3 →