Multi-component Models for Object Detection

被引:0
作者
Gu, Chunhui [1 ]
Arbelaez, Pablo [2 ]
Lin, Yuanqing [3 ]
Yu, Kai [4 ]
Malik, Jitendra [2 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
[2] Univ Calif Berkeley, Berkeley, CA USA
[3] NEC Labs Amer, Cupertino, CA USA
[4] Baidu Inc, Beijing, Peoples R China
来源
COMPUTER VISION - ECCV 2012, PT IV | 2012年 / 7575卷
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we propose a multi-component approach for object detection. Rather than attempting to represent an object category with a monolithic model, or pre-defining a reduced set of aspects, we form visual clusters from the data that are tight in appearance and configuration spaces. We train individual classifiers for each component, and then learn a second classifier that operates at the category level by aggregating responses from multiple components. In order to reduce computation cost during detection, we adopt the idea of object window selection, and our segmentation-based selection mechanism produces fewer than 500 windows per image while preserving high object recall. When compared to the leading methods in the challenging VOC PASCAL 2010 dataset, our multi-component approach obtains highly competitive results. Furthermore, unlike monolithic detection methods, our approach allows the transfer of finer-grained semantic information from the components, such as keypoint location and segmentation masks.
引用
收藏
页码:445 / 458
页数:14
相关论文
共 24 条
  • [1] Alexe B., 2010, Computer Vision and Pattern Recognition
  • [2] Andrews S., 2002, The Neural Information Processing Systems
  • [3] [Anonymous], INT C COMP VIS
  • [4] [Anonymous], 2009, INT C COMP VIS
  • [5] [Anonymous], 2011, INT C COMP VIS
  • [6] [Anonymous], 2010, INT J COMPUT VISION, DOI DOI 10.1007/s11263-009-0275-4
  • [7] [Anonymous], COMPUTER VISION PATT
  • [8] [Anonymous], 2010, Computer Vision and Pattern Recognition CVPR
  • [9] [Anonymous], COMPUTER VISION PATT
  • [10] [Anonymous], 2008, VLFeat: An open and portable library of computer vision algorithms