Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations

被引:413
作者
Bourdev, Lubomir [1 ]
Malik, Jitendra [1 ]
机构
[1] Univ Calif Berkeley, EECS, Berkeley, CA 94720 USA
来源
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2009年
关键词
D O I
10.1109/ICCV.2009.5459303
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We address the classic problems of detection, segmentation and pose estimation of people in images with a novel definition of a part, a poselet. We postulate two criteria (1) It should be easy to find a poselet given an input image (2) it should be easy to localize the 3D configuration of the person conditioned on the detection of a poselet. To permit this we have built a new dataset, H3D, of annotations of humans in 2D photographs with 3D joint information, inferred using anthropometric constraints. This enables us to implement a data-driven search procedure for finding poselets that are tightly clustered in both 3D joint configuration space as well as 2D image appearance. The algorithm discovers poselets that correspond to frontal and profile faces, pedestrians, head and shoulder views, among others. Each poselet provides examples for training a linear SVM classifier which can then be run over the image in a multiscale scanning mode. The outputs of these poselet detectors can be thought of as an intermediate layer of nodes, on top of which one can run a second layer of classification or regression. We show how this permits detection and localization of torsos or keypoints such as left shoulder, nose, etc. Experimental results show that we obtain state of the art performance on people detection in the PASCAL VOC 2007 challenge, among other datasets. We are making publicly available both the H3D dataset as well as the poselet parameters for use by other researchers.
引用
收藏
页码:1365 / 1372
页数:8
相关论文
共 21 条
  • [1] [Anonymous], 2006, NIPS
  • [2] [Anonymous], 2004, P WORKSH STAT LEARN
  • [3] [Anonymous], 2006, HUMANEVA SYNCHRONIZ
  • [4] [Anonymous], 2009, CVPR
  • [5] [Anonymous], 2008, CVPR
  • [6] [Anonymous], 2005, CVPR
  • [7] Dalal N., CVPR
  • [8] Everingham M., PASCAL VISUAL OBJECT
  • [9] Pictorial structures for object recognition
    Felzenszwalb, PF
    Huttenlocher, DP
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2005, 61 (01) : 55 - 79
  • [10] Progressive search space reduction for human pose estimation
    Ferrari, Vittorio
    Marin-Jimenez, Manuel
    Zisserman, Andrew
    [J]. 2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008,