Bayesian 3D tracking from monocular video

被引:8
|
作者
Brau, Ernesto [1 ]
Guan, Jinyan [1 ]
Simek, Kyle [1 ]
Del Pero, Luca [3 ]
Dawson, Colin Reimer [2 ]
Barnard, Kobus [2 ]
机构
[1] Univ Arizona, Comp Sci, Tucson, AZ 85721 USA
[2] Univ Arizona, Sch Informat, Tucson, AZ 85721 USA
[3] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
来源
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2013年
关键词
D O I
10.1109/ICCV.2013.418
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop a Bayesian modeling approach for tracking people in 3D from monocular video with unknown cameras. Modeling in 3D provides natural explanations for occlusions and smoothness discontinuities that result from projection, and allows priors on velocity and smoothness to be grounded in physical quantities: meters and seconds vs. pixels and frames. We pose the problem in the context of data association, in which observations are assigned to tracks. A correct application of Bayesian inference to multitarget tracking must address the fact that the model's dimension changes as tracks are added or removed, and thus, posterior densities of different hypotheses are not comparable. We address this by marginalizing out the trajectory parameters so the resulting posterior over data associations has constant dimension. This is made tractable by using (a) Gaussian process priors for smooth trajectories and (b) approximately Gaussian likelihood functions. Our approach provides a principled method for incorporating multiple sources of evidence; we present results using both optical flow and object detector outputs. Results are comparable to recent work on 3D tracking and, unlike others, our method requires no pre-calibrated cameras.
引用
收藏
页码:3368 / 3375
页数:8
相关论文
共 50 条
  • [1] Markerless 3D human motion tracking for monocular video sequences
    Zou, Beiji
    Chen, Shu
    Peng, Xiaoning
    Shi, Cao
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2008, 20 (08): : 1047 - 1055
  • [2] Feasibility of 3D Body Tracking from Monocular 2D Video Feeds in Musculoskeletal Telerehabilitation
    Clemente, Carolina
    Chambel, Goncalo
    Silva, Diogo C. F.
    Montes, Antonio Mesquita
    Pinto, Joana F.
    da Silva, Hugo Placido
    SENSORS, 2024, 24 (01)
  • [3] Corrective 3D Reconstruction of Lips from Monocular Video
    Garrido, Pablo
    Zollhoefer, Michael
    Wu, Chenglei
    Bradley, Derek
    Perez, Patrick
    Beeler, Thabo
    Theobalt, Christian
    ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (06):
  • [4] PERGAMO: Personalized 3D Garments from Monocular Video
    Casado-Elvira, Andres
    Trinidad, Marc Comino
    Casas, Dan
    COMPUTER GRAPHICS FORUM, 2022, 41 (08) : 293 - 304
  • [5] 3D Motion and Skeleton Construction from Monocular Video
    Azmi, Nik Mohammad Wafiy
    Albakri, Ikmal Faiq
    Suaib, Norhaida Mohd
    Rahim, Mohd Shafry Mohd
    Yu, Hongchuan
    COMPUTATIONAL SCIENCE AND TECHNOLOGY (ICCST 2019), 2020, 603 : 75 - 84
  • [6] Robust 3D arm tracking from monocular videos
    Guo, F
    Qian, G
    ADVANCES IN INTELLIGENT COMPUTING, PT 2, PROCEEDINGS, 2005, 3645 : 841 - 850
  • [7] Creating stereoscopic (3D) video from a 2D monocular video stream
    Li, Xiaokun
    Xu, Roger
    Zhou, Jin
    Li, Baoxin
    ADVANCES IN VISUAL COMPUTING, PT I, 2007, 4841 : 258 - +
  • [8] Monocular 3D Tracking of Deformable Surfaces
    Puig, Luis
    Daniilidis, Kostas
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 580 - 586
  • [9] Recognition and 3D Localization of Pedestrian Actions from Monocular Video
    Hayakawa, Jun
    Dariush, Behzad
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [10] 3D environment capture from monocular video and inertial data
    Clark, R. Robert
    Lin, Michael H.
    Taylor, Colin J.
    THREE-DIMENSIONAL IMAGE CAPTURE AND APPLICATIONS VII, 2006, 6056