MULTI-SCALE GEM POOLING WITH N-PAIR CENTER LOSS FOR FINE-GRAINED IMAGE SEARCH

被引:11
作者
Deng, Youming [1 ]
Lin, Xianming [1 ]
Li, Run [1 ]
Ji, Rongrong [1 ]
机构
[1] Xiamen Univ, Sch Informat Sci & Engn, Dept Cognit Sci, Fujian Key Lab Sensing & Comp Smart City, Xiamen, Fujian, Peoples R China
来源
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2019年
关键词
Deep feature learning; Attention mechanism; Fine-grained image search;
D O I
10.1109/ICME.2019.00176
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Most existing fine-grained image retrieval schemes are built based upon deep feature learning paradigms, which typically leverage the feature maps of the last convolutional layer as features. However, such representation focuses only on the global information of the object, leaving the local details unexploited, which is however crucial to identifying subtle differences for fine-grained retrieval. In this paper, we have discovered that the mid-level feature map roles as local salient regions, which well complements the existing global feature representations. To this end, a multi-layer framework is proposed to integrate both local and global representations with generalized mean (GeM) pooling and attention mechanism, trained with the proposed N-pair Center loss to learn more discriminative features. By doing so, state-of-the-art performance can be achieved without using the hard or negative example minings. In the experiments, our approach outperforms favourably compared to the current state-of-the-art methods on the CUB-200-2011, CARS196 and In-shop Clothes Retrieval datasets.
引用
收藏
页码:1000 / 1005
页数:6
相关论文
共 27 条
[1]  
[Anonymous], 2018, IJCAI
[2]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[3]  
He K., 2016, IEEE C COMPUT VIS PA, DOI [10.1007/978-3-319-46493-0_38, DOI 10.1007/978-3-319-46493-0_38, DOI 10.1109/CVPR.2016.90]
[4]  
Hu J., 2018, ARXIV181200573
[5]  
Hu Jie, 2018, CVPR
[6]  
Jégou H, 2010, PROC CVPR IEEE, P3304, DOI 10.1109/CVPR.2010.5540039
[7]   3D Object Representations for Fine-Grained Categorization [J].
Krause, Jonathan ;
Stark, Michael ;
Deng, Jia ;
Li Fei-Fei .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, :554-561
[8]  
Lin M.-B., 2018, ACM MM
[9]   No Fuss Distance Metric Learning using Proxies [J].
Movshovitz-Attias, Yair ;
Toshev, Alexander ;
Leung, Thomas K. ;
Ioffe, Sergey ;
Singh, Saurabh .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :360-368
[10]   Large-Scale Image Retrieval with Attentive Deep Local Features [J].
Noh, Hyeonwoo ;
Araujo, Andre ;
Sim, Jack ;
Weyand, Tobias ;
Han, Bohyung .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3476-3485