A Region-Based Gauss-Newton Approach to Real-Time Monocular Multiple Object Tracking

被引:70
作者
Tjaden, Henning [1 ]
Schwanecke, Ulrich [2 ]
Schoemer, Elmar [3 ]
Cremers, Daniel [4 ]
机构
[1] RheinMain Univ Appl Sci, Comp Vis & MixedReal Grp, D-65197 Wiesbaden, Germany
[2] RheinMain Univ Appl Sci, Comp Graph & Vis, D-65197 Wiesbaden, Germany
[3] Johannes Gutenberg Univ Mainz, Comp Graph & Computat Geometry, D-55122 Mainz, Germany
[4] Tech Univ Munich, D-80333 Munich, Germany
关键词
Pose estimation; tracking; image segmentation; region-based; optimization; dataset; SEGMENTATION; COLOR;
D O I
10.1109/TPAMI.2018.2884990
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an algorithm for real-time 6DOF pose tracking of rigid 3D objects using a monocular RGB camera. The key idea is to derive a region-based cost function using temporally consistent local color histograms. While such region-based cost functions are commonly optimized using first-order gradient descent techniques, we systematically derive a Gauss-Newton optimization scheme which gives rise to drastically faster convergence and highly accurate and robust tracking performance. We furthermore propose a novel complex dataset dedicated for the task of monocular object pose tracking and make it publicly available to the community. To our knowledge, it is the first to address the common and important scenario in which both the camera as well as the objects are moving simultaneously in cluttered scenes. In numerous experiments-including our own proposed dataset-we demonstrate that the proposed Gauss-Newton approach outperforms existing approaches, in particular in the presence of cluttered backgrounds, heterogeneous objects and partial occlusions.
引用
收藏
页码:1797 / 1812
页数:16
相关论文
共 41 条
[1]  
[Anonymous], 2015, ROBOTICS SCI SYSTEMS
[2]  
[Anonymous], IEEE I CONF COMP VIS
[3]  
[Anonymous], 2013, PROC ASIAN C COMPUT
[4]  
[Anonymous], 2014, ACCV
[5]  
[Anonymous], P ECCV WORKSH REC 6D
[6]  
Bibby C, 2008, LECT NOTES COMPUT SC, V5303, P831, DOI 10.1007/978-3-540-88688-4_61
[7]   Uncertainty-Driven 6D Pose Estimation of Objects and Scenes from a Single RGB Image [J].
Brachmann, Eric ;
Michel, Frank ;
Krull, Alexander ;
Yang, Michael Ying ;
Gumhold, Stefan ;
Rother, Carsten .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3364-3372
[8]   Combined Region and Motion-Based 3D Tracking of Rigid and Articulated Objects [J].
Brox, Thomas ;
Rosenhahn, Bodo ;
Gall, Juergen ;
Cremers, Daniel .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (03) :402-415
[9]  
Choi C, 2013, IEEE INT C INT ROBOT, P1084, DOI 10.1109/IROS.2013.6696485
[10]   A review of statistical approaches to level set segmentation: Integrating color, texture, motion and shape [J].
Cremers, Daniel ;
Rousson, Mikael ;
Deriche, Rachid .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 72 (02) :195-215