Region-Based Saliency Detection and Its Application in Object Recognition

被引:292
作者
Ren, Zhixiang [1 ]
Gao, Shenghua [2 ]
Chia, Liang-Tien [1 ]
Tsang, Ivor Wai-Hung [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Nanyang 639798, Singapore
[2] Adv Digital Sci Ctr, Singapore, Singapore
关键词
Object recognition; saliency detection; saliency propagation; superpixel; weighted sparse coding; VISUAL-ATTENTION; MODEL; FEATURES;
D O I
10.1109/TCSVT.2013.2280096
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The objective of this paper is twofold. First, we introduce an effective region-based solution for saliency detection. Then, we apply the achieved saliency map to better encode the image features for solving object recognition task. To find the perceptually and semantically meaningful salient regions, we extract superpixels based on an adaptive mean shift algorithm as the basic elements for saliency detection. The saliency of each superpixel is measured by using its spatial compactness, which is calculated according to the results of Gaussian mixture model (GMM) clustering. To propagate saliency between similar clusters, we adopt a modified PageRank algorithm to refine the saliency map. Our method not only improves saliency detection through large salient region detection and noise tolerance in messy background, but also generates saliency maps with a well-defined object shape. Experimental results demonstrate the effectiveness of our method. Since the objects usually correspond to salient regions, and these regions usually play more important roles for object recognition than background, we apply our achieved saliency map for object recognition by incorporating a saliency map into sparse coding-based spatial pyramid matching (ScSPM) image representation. To learn a more discriminative codebook and better encode the features corresponding to the patches of the objects, we propose a weighted sparse coding for feature coding. Moreover, we also propose a saliency weighted max pooling to further emphasize the importance of those salient regions in feature pooling module. Experimental results on several datasets illustrate that our weighted ScSPM framework greatly outperforms ScSPM framework, and achieves excellent performance for object recognition.
引用
收藏
页码:769 / 779
页数:11
相关论文
共 69 条
[1]  
Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596
[2]  
Alexe B, 2010, PROC CVPR IEEE, P73, DOI 10.1109/CVPR.2010.5540226
[3]  
[Anonymous], 2006, ADV NEURAL INF PROCE
[4]  
[Anonymous], 2011, P 17 ACM SIGKDD INT, DOI DOI 10.1145/2020408
[5]  
[Anonymous], 2011, P IEEE MTT S INT MIC
[6]  
[Anonymous], 2008, 2008 19 INT C PATT R
[7]  
[Anonymous], 2004, Advances in neural information processing systems
[8]  
[Anonymous], 2009, P IEEE C COMP VIS PA
[9]  
[Anonymous], 2007, P 15 ACM INT C MULT
[10]  
[Anonymous], 2007, PROC IEEE C COMPUT V, DOI 10.1109/CVPR.2007.383267