Merging Pose Estimates Across Space and Time

被引:13
作者
Burgos-Artizzu, Xavier P. [1 ]
Hall, David [1 ]
Perona, Pietro [1 ]
Dollar, Piotr [2 ]
机构
[1] CALTECH, Pasadena, CA 91125 USA
[2] Microsoft Res, Redmond, WA USA
来源
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013 | 2013年
关键词
D O I
10.5244/C.27.58
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Numerous 'non-maximum suppression' (NMS) post-processing schemes have been proposed for merging multiple independent object detections. We propose a generalization of NMS beyond bounding boxes to merge multiple pose estimates in a single frame. The final estimates are centroids rather than medoids as in standard NMS, thus being more accurate than any of the individual candidates. Using the same mathematical framework, we extend our approach to the multi-frame setting, merging multiple independent pose estimates across space and time and outputting both the number and pose of the objects present in a scene. Our approach sidesteps many of the inherent challenges associated with full tracking (e.g. objects entering/leaving a scene, extended periods of occlusion, etc.). We show its versatility by applying it to two distinct state-of-the-art pose estimation algorithms in three domains: human bodies, faces and mice. Our approach improves both detection accuracy (by helping disambiguate correspondences) as well as pose estimation quality and is computationally efficient.
引用
收藏
页数:11
相关论文
共 26 条
[11]  
Dollar P., 2010, BMVC 2010, DOI DOI 10.5244/C.24.68
[12]   Cascaded Pose Regression [J].
Dollar, Piotr ;
Welinder, Peter ;
Perona, Pietro .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :1078-1085
[13]   2D Articulated Human Pose Estimation and Retrieval in (Almost) Unconstrained Still Images [J].
Eichner, M. ;
Marin-Jimenez, M. ;
Zisserman, A. ;
Ferrari, V. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 99 (02) :190-214
[14]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[15]   Progressive search space reduction for human pose estimation [J].
Ferrari, Vittorio ;
Marin-Jimenez, Manuel ;
Zisserman, Andrew .
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008,
[16]  
Gross R., 2008, FG
[17]  
Le V, 2012, LECT NOTES COMPUT SC, V7574, P679, DOI 10.1007/978-3-642-33712-3_49
[18]   A survey of computer vision-based human motion capture [J].
Moeslund, TB ;
Granum, E .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2001, 81 (03) :231-268
[19]   Visual interpretation of hand gestures for human-computer interaction: A review [J].
Pavlovic, VI ;
Sharma, R ;
Huang, TS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) :677-695
[20]  
Peursum Patrick., 2007, CVPR