Bayesian 3D tracking from monocular video

被引:8
作者
Brau, Ernesto [1 ]
Guan, Jinyan [1 ]
Simek, Kyle [1 ]
Del Pero, Luca [3 ]
Dawson, Colin Reimer [2 ]
Barnard, Kobus [2 ]
机构
[1] Univ Arizona, Comp Sci, Tucson, AZ 85721 USA
[2] Univ Arizona, Sch Informat, Tucson, AZ 85721 USA
[3] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
来源
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2013年
关键词
D O I
10.1109/ICCV.2013.418
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop a Bayesian modeling approach for tracking people in 3D from monocular video with unknown cameras. Modeling in 3D provides natural explanations for occlusions and smoothness discontinuities that result from projection, and allows priors on velocity and smoothness to be grounded in physical quantities: meters and seconds vs. pixels and frames. We pose the problem in the context of data association, in which observations are assigned to tracks. A correct application of Bayesian inference to multitarget tracking must address the fact that the model's dimension changes as tracks are added or removed, and thus, posterior densities of different hypotheses are not comparable. We address this by marginalizing out the trajectory parameters so the resulting posterior over data associations has constant dimension. This is made tractable by using (a) Gaussian process priors for smooth trajectories and (b) approximately Gaussian likelihood functions. Our approach provides a principled method for incorporating multiple sources of evidence; we present results using both optical flow and object detector outputs. Results are comparable to recent work on 3D tracking and, unlike others, our method requires no pre-calibrated cameras.
引用
收藏
页码:3368 / 3375
页数:8
相关论文
共 50 条
  • [21] Monocular 3D Pose Estimation and Tracking by Detection
    Andriluka, Mykhaylo
    Roth, Stefan
    Schiele, Bernt
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 623 - 630
  • [22] BAYESIAN BASED 3D SHAPE RECONSTRUCTION FROM VIDEO
    Ghosh, Nirmalya
    Bhanu, Bir
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 1152 - 1155
  • [23] Monocular 3D Pose Tracking of a Specular Object
    Oumer, Nassir W.
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 3, 2014, : 458 - 465
  • [24] Joint Monocular 3D Vehicle Detection and Tracking
    Hu, Hou-Ning
    Cai, Qi-Zhi
    Wang, Dequan
    Lin, Ji
    Sun, Min
    Krahenbuhl, Philipp
    Darrell, Trevor
    Yu, Fisher
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5389 - 5398
  • [25] Monocular 3D Tracking of Multiple Interacting Targets
    Osawa, Tatsuya
    Sudo, Kyoko
    Arai, Hiroyuki
    Koike, Hideki
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3934 - 3937
  • [26] 3D Reconstruction of Human Motion and Skeleton from Uncalibrated Monocular Video
    Chen, Yen-Lin
    Chai, Jinxiang
    COMPUTER VISION - ACCV 2009, PT I, 2010, 5994 : 71 - 82
  • [27] Towards robust 3D reconstruction of human motion from monocular video
    Chen, Cheng
    Zhuang, Yueting
    Xiao, Jun
    Advances in Artificial Reality and Tele-Existence, Proceedings, 2006, 4282 : 594 - 603
  • [28] 3D scene reconstruction from monocular spherical video with motion parallax
    Tanaka, Kenji
    2022 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY ADJUNCT (ISMAR-ADJUNCT 2022), 2022, : 191 - 197
  • [29] Tracking and matching connected components from 3D video
    Pires, DD
    Cesar, RM
    Vieira, MB
    Velho, L
    SIBGRAPI 2005: XVIII BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING, CONFERENCE PROCEEDINGS, 2005, : 257 - 264
  • [30] Bridging the gap between detection and tracking for 3D monocular video-based motion capture
    Fossati, Andrea
    Dimitrijevic, Miodrag
    Lepetit, Vincent
    Fua, Pascal
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 2510 - +