Aggregating Deep Convolutional Features for Image Retrieval

被引:632
作者
Babenko, Artem [1 ]
Lempitsky, Victor [2 ]
机构
[1] Yandex, Moscow Inst Phys & Technol, Moscow, Russia
[2] Skolkovo Inst Sci & Technol Skoltech, Moscow, Russia
来源
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2015年
关键词
D O I
10.1109/ICCV.2015.150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several recent works have shown that image descriptors produced by deep convolutional neural networks provide state-of-the-art performance for image classification and retrieval problems. It has also been shown that the activations from the convolutional layers can be interpreted as local features describing particular image regions. These local features can be aggregated using aggregation approaches developed for local features (e. g. Fisher vectors), thus providing new powerful global descriptors. In this paper we investigate possible ways to aggregate local deep features to produce compact global descriptors for image retrieval. First, we show that deep features and traditional hand-engineered features have quite different distributions of pairwise similarities, hence existing aggregation methods have to be carefully re-evaluated. Such re-evaluation reveals that in contrast to shallow features, the simple aggregation method based on sum pooling provides arguably the best performance for deep convolutional features. This method is efficient, has few parameters, and bears little risk of overfitting when e. g. learning the PCA matrix. Overall, the new compact global descriptor improves the state-of-the-art on four common benchmarks considerably.
引用
收藏
页码:1269 / 1277
页数:9
相关论文
共 26 条
[1]  
[Anonymous], 2009, Advances in neural information processing systems
[2]  
[Anonymous], 2015, PROC INT C LEARN REP
[3]  
[Anonymous], 1989, P ADV NEURAL INFORM
[4]  
[Anonymous], EUR C COMP VIS
[5]  
[Anonymous], IEEE COMP SOC C COMP
[6]  
[Anonymous], ABS14126574 CORR
[7]  
[Anonymous], 2014, P BRIT MACH VIS C 20
[8]  
[Anonymous], ABS14065774 CORR
[9]  
[Anonymous], 2006, IEEE C COMP VIS PATT
[10]  
[Anonymous], P IEEE C COMP VIS PA