Bayesian 3D tracking from monocular video

被引:8
作者
Brau, Ernesto [1 ]
Guan, Jinyan [1 ]
Simek, Kyle [1 ]
Del Pero, Luca [3 ]
Dawson, Colin Reimer [2 ]
Barnard, Kobus [2 ]
机构
[1] Univ Arizona, Comp Sci, Tucson, AZ 85721 USA
[2] Univ Arizona, Sch Informat, Tucson, AZ 85721 USA
[3] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
来源
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2013年
关键词
D O I
10.1109/ICCV.2013.418
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop a Bayesian modeling approach for tracking people in 3D from monocular video with unknown cameras. Modeling in 3D provides natural explanations for occlusions and smoothness discontinuities that result from projection, and allows priors on velocity and smoothness to be grounded in physical quantities: meters and seconds vs. pixels and frames. We pose the problem in the context of data association, in which observations are assigned to tracks. A correct application of Bayesian inference to multitarget tracking must address the fact that the model's dimension changes as tracks are added or removed, and thus, posterior densities of different hypotheses are not comparable. We address this by marginalizing out the trajectory parameters so the resulting posterior over data associations has constant dimension. This is made tractable by using (a) Gaussian process priors for smooth trajectories and (b) approximately Gaussian likelihood functions. Our approach provides a principled method for incorporating multiple sources of evidence; we present results using both optical flow and object detector outputs. Results are comparable to recent work on 3D tracking and, unlike others, our method requires no pre-calibrated cameras.
引用
收藏
页码:3368 / 3375
页数:8
相关论文
共 50 条
[41]   Kinematic jump processes for monocular 3D human tracking [J].
Sminchisescu, C ;
Triggs, B .
2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2003, :69-76
[42]   The Monocular Model-based 3D Pose Tracking [J].
Tong, Guofeng ;
Liu, Ran ;
Li, Hairong .
PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, :980-985
[43]   Monocular Multiview Object Tracking with 3D Aspect Parts [J].
Xiang, Yu ;
Song, Changkyu ;
Mottaghi, Roozbeh ;
Savarese, Silvio .
COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 :220-235
[44]   3D Tracking and Classification System Using a Monocular Camera [J].
George Bardas ;
Stefanos Astaras ;
Sotirios Diamantas ;
Aristodemos Pnevmatikakis .
Wireless Personal Communications, 2017, 92 :63-85
[45]   Covariance scaled sampling for monocular 3D body tracking [J].
Sminchisescu, C ;
Triggs, B .
2001 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2001, :447-454
[46]   3D Tracking and Classification System Using a Monocular Camera [J].
Bardas, George ;
Astaras, Stefanos ;
Diamantas, Sotirios ;
Pnevmatikakis, Aristodemos .
WIRELESS PERSONAL COMMUNICATIONS, 2017, 92 (01) :63-85
[47]   Monocular Semantic Mapping Based on 3D Cuboids Tracking [J].
Ji, Xingwu ;
Gong, Zheng ;
Miao, Ruihang ;
Xue, Wuyang ;
Ying, Rendong .
2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
[48]   Bayesian detection and tracking of odontocetes in 3D from their echolocation clicks [J].
Jang, Junsu ;
Meyer, Florian ;
Snyder, Eric ;
Wiggins, Sean ;
Baumann-Pickering, Simone ;
Hildebrand, John .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (03)
[49]   MotioNet: 3D Human motion reconstruction from monocular video with skeleton consistency [J].
Shi, Mingyi ;
Aberman, Kfir ;
Aristidou, Andreas ;
Komura, Taku ;
Lischinski, Dani ;
Cohen-Or, Daniel ;
Chen, Baoquan .
ACM Transactions on Graphics, 2020, 40 (01)
[50]   Uncertainty-Aware 3D Human Pose Estimation from Monocular Video [J].
Zhang, Jinlu ;
Chen, Yujin ;
Tu, Zhigang .
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, :5102-5113