Weakly Supervised Learning of Deformable Part-Based Models for Object Detection via Region Proposals

被引:39
作者
Tang, Yuxing [1 ]
Wang, Xiaofang [1 ]
Dellandrea, Emmanuel [1 ]
Chen, Liming [1 ]
机构
[1] Ecole Cent Lyon, LIRIS, CNRS, UMR 5205, F-69134 Ecully, France
关键词
Deformable part-based models ( DPMs); object detection; region proposals; weakly supervised learning; LOCALIZATION; HISTOGRAMS; GRADIENTS;
D O I
10.1109/TMM.2016.2614862
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The success of deformable part-based models (DPMs) for visual object detection relies on a large number of labeled bounding boxes. With only image-level annotations, our goal is to propose a model enhancing the weakly supervised DPMs by emphasizing the importance of location and size of the initial class- specific root filter. To adaptively select a discriminative set of candidate bounding boxes as this root filter estimate, first, we explore the generic objectness measurement to combine the most salient regions and "good" region proposals. Second, we propose learning of the latent class label of each candidate window as a binary classification problem, by training category- specific classifiers used to coarsely classify a candidate window into either a target object or a nontarget class. Finally, we design a flexible enlarging-and-shrinking postprocessing procedure to modify the DPMs outputs, which can effectively match the approximative object aspect ratios and further improve final accuracy. Extensive experimental results on the challenging PASCAL Visual Object Class 2007 and the Microsoft Common Objects in Context 2014 dataset demonstrate that our proposed framework is effective for initialization of the DPM's root filter. It also shows competitive final localization performance with state-of-the-art weakly supervised object detectionmethods, particularly for the object categories that are relatively salient in the images and deformable in structures.
引用
收藏
页码:393 / 407
页数:15
相关论文
共 59 条
[1]   Measuring the Objectness of Image Windows [J].
Alexe, Bogdan ;
Deselaers, Thomas ;
Ferrari, Vittorio .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2189-2202
[2]  
[Anonymous], P EUR C COMPUT VIS
[3]  
[Anonymous], 2006, Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on, DOI DOI 10.1109/CVPR.2006.303
[4]   Multiscale Combinatorial Grouping [J].
Arbelaez, Pablo ;
Pont-Tuset, Jordi ;
Barron, Jonathan T. ;
Marques, Ferran ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :328-335
[5]  
Azizpour H, 2012, LECT NOTES COMPUT SC, V7572, P836, DOI 10.1007/978-3-642-33718-5_60
[6]  
Bilen H, 2014, P BMVC 2014, P112
[7]  
Bilen H, 2015, PROC CVPR IEEE, P1081, DOI 10.1109/CVPR.2015.7298711
[8]   CPMC: Automatic Object Segmentation Using Constrained Parametric Min-Cuts [J].
Carreira, Joao ;
Sminchisescu, Cristian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (07) :1312-1328
[9]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[10]   BING: Binarized Normed Gradients for Objectness Estimation at 300fps [J].
Cheng, Ming-Ming ;
Zhang, Ziming ;
Lin, Wen-Yan ;
Torr, Philip .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3286-3293