Bayesian 3D tracking from monocular video

被引:8
作者
Brau, Ernesto [1 ]
Guan, Jinyan [1 ]
Simek, Kyle [1 ]
Del Pero, Luca [3 ]
Dawson, Colin Reimer [2 ]
Barnard, Kobus [2 ]
机构
[1] Univ Arizona, Comp Sci, Tucson, AZ 85721 USA
[2] Univ Arizona, Sch Informat, Tucson, AZ 85721 USA
[3] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
来源
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2013年
关键词
D O I
10.1109/ICCV.2013.418
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop a Bayesian modeling approach for tracking people in 3D from monocular video with unknown cameras. Modeling in 3D provides natural explanations for occlusions and smoothness discontinuities that result from projection, and allows priors on velocity and smoothness to be grounded in physical quantities: meters and seconds vs. pixels and frames. We pose the problem in the context of data association, in which observations are assigned to tracks. A correct application of Bayesian inference to multitarget tracking must address the fact that the model's dimension changes as tracks are added or removed, and thus, posterior densities of different hypotheses are not comparable. We address this by marginalizing out the trajectory parameters so the resulting posterior over data associations has constant dimension. This is made tractable by using (a) Gaussian process priors for smooth trajectories and (b) approximately Gaussian likelihood functions. Our approach provides a principled method for incorporating multiple sources of evidence; we present results using both optical flow and object detector outputs. Results are comparable to recent work on 3D tracking and, unlike others, our method requires no pre-calibrated cameras.
引用
收藏
页码:3368 / 3375
页数:8
相关论文
共 50 条
[21]   BAYESIAN BASED 3D SHAPE RECONSTRUCTION FROM VIDEO [J].
Ghosh, Nirmalya ;
Bhanu, Bir .
2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, :1152-1155
[22]   Silhouette lookup for monocular 3D pose tracking [J].
Howe, Nicholas R. .
IMAGE AND VISION COMPUTING, 2007, 25 (03) :331-341
[23]   Monocular 3D Pose Estimation and Tracking by Detection [J].
Andriluka, Mykhaylo ;
Roth, Stefan ;
Schiele, Bernt .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :623-630
[24]   Monocular 3D Pose Tracking of a Specular Object [J].
Oumer, Nassir W. .
PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 3, 2014, :458-465
[25]   Joint Monocular 3D Vehicle Detection and Tracking [J].
Hu, Hou-Ning ;
Cai, Qi-Zhi ;
Wang, Dequan ;
Lin, Ji ;
Sun, Min ;
Krahenbuhl, Philipp ;
Darrell, Trevor ;
Yu, Fisher .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :5389-5398
[26]   Monocular 3D Tracking of Multiple Interacting Targets [J].
Osawa, Tatsuya ;
Sudo, Kyoko ;
Arai, Hiroyuki ;
Koike, Hideki .
19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, :3934-3937
[27]   3D Reconstruction of Human Motion and Skeleton from Uncalibrated Monocular Video [J].
Chen, Yen-Lin ;
Chai, Jinxiang .
COMPUTER VISION - ACCV 2009, PT I, 2010, 5994 :71-82
[28]   In-Hand 3D Object Reconstruction from a Monocular RGB Video [J].
Jiang, Shijian ;
Ye, Qi ;
Xie, Rengan ;
Huo, Yuchi ;
Li, Xiang ;
Zhou, Yang ;
Chen, Jiming .
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 3, 2024, :2525-2533
[29]   Towards robust 3D reconstruction of human motion from monocular video [J].
Chen, Cheng ;
Zhuang, Yueting ;
Xiao, Jun .
Advances in Artificial Reality and Tele-Existence, Proceedings, 2006, 4282 :594-603
[30]   3D scene reconstruction from monocular spherical video with motion parallax [J].
Tanaka, Kenji .
2022 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY ADJUNCT (ISMAR-ADJUNCT 2022), 2022, :191-197