Local Pyramidal Descriptors for Image Recognition

被引:39
作者
Seidenari, Lorenzo [1 ]
Serra, Giuseppe [2 ]
Bagdanov, Andrew D. [1 ]
Del Bimbo, Alberto [1 ]
机构
[1] Univ Florence, Media Integrat & Commun Ctr, I-50139 Florence, Italy
[2] Univ Modena & Reggio Emilia, I-41100 Modena, Italy
关键词
Object categorization; local features; kernel methods;
D O I
10.1109/TPAMI.2013.232
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a novel method to improve the flexibility of descriptor matching for image recognition by using local multiresolution pyramids in feature space. We propose that image patches be represented at multiple levels of descriptor detail and that these levels be defined in terms of local spatial pooling resolution. Preserving multiple levels of detail in local descriptors is a way of hedging one's bets on which levels will most relevant for matching during learning and recognition. We introduce the Pyramid SIFT (P-SIFT) descriptor and show that its use in four state-of-the-art image recognition pipelines improves accuracy and yields state-of-the-art results. Our technique is applicable independently of spatial pyramid matching and we show that spatial pyramids can be combined with local pyramids to obtain further improvement. We achieve state-of-the-art results on Caltech-101 (80.1%) and Caltech-256 (52.6%) when compared to other approaches based on SIFT features over intensity images. Our technique is efficient and is extremely easy to integrate into image recognition pipelines.
引用
收藏
页码:1033 / 1040
页数:8
相关论文
共 30 条
[1]  
[Anonymous], P ICCV
[2]  
[Anonymous], P ICCV
[3]  
[Anonymous], 2006, IEEE COMP SOC C COMP
[4]  
[Anonymous], 2010, P CVPR
[5]  
[Anonymous], 2005, P CVPR
[6]  
[Anonymous], 2005, P CVPR
[7]  
[Anonymous], 2009, P CVPR
[8]  
[Anonymous], 2008, P CVPR
[9]  
Bo L., 2009, P NIPS
[10]  
Bo L., 2013, P CVPR