3D human pose estimation in multi-view operating room videos using differentiable camera projections

被引:2
|
作者
Gerats, Beerend G. A. [1 ,2 ,5 ]
Wolterink, Jelmer M. [3 ,4 ]
Broeders, Ivo A. M. J. [1 ,2 ]
机构
[1] Meander Med Ctr, Ctr Artificial Intelligence, Amersfoort, Netherlands
[2] Univ Twente, Robot & Mechatron, Enschede, Netherlands
[3] Univ Twente, Dept Appl Math, Enschede, Netherlands
[4] Univ Twente, Tech Med Ctr, Enschede, Netherlands
[5] Meander Med Ctr, Ctr Artificial Intelligence, Maatweg 3, NL-3813 TZ Amersfoort, Netherlands
来源
COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION | 2023年 / 11卷 / 04期
关键词
Human pose estimation; operating room; differentiable camera projection;
D O I
10.1080/21681163.2022.2155580
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
3D human pose estimation in multi-view operating room (OR) videos is a relevant asset for person tracking and action recognition. However, the surgical environment makes it challenging to find poses due to sterile clothing, frequent occlusions and limited public data. Methods specifically designed for the OR are generally based on the fusion of detected poses in multiple camera views. Typically, a 2D pose estimator such as a convolutional neural network (CNN) detects joint locations. Then, the detected joint locations are projected to 3D and fused over all camera views. However, accurate detection in 2D does not guarantee accurate localisation in 3D space. In this work, we propose to directly optimise for localisation in 3D by training 2D CNNs end-to-end based on a 3D loss that is backpropagated through each camera's projection parameters. Using videos from the MVOR dataset, we show that this end-to-end approach outperforms optimisation in 2D space.
引用
收藏
页码:1197 / 1205
页数:9
相关论文
共 50 条
  • [1] Multi-view Pictorial Structures for 3D Human Pose Estimation
    Amin, Sikandar
    Andriluka, Mykhaylo
    Rohrbach, Marcus
    Schiele, Bernt
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
  • [2] Multi-view 3D Human Pose Estimation in Complex Environment
    M. Hofmann
    D. M. Gavrila
    International Journal of Computer Vision, 2012, 96 : 103 - 124
  • [3] Generative Multi-View Based 3D Human Pose Estimation
    Sabri, Motaz
    PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY, SIET 2021, 2021, : 2 - 9
  • [4] PROGRESSIVE MULTI-VIEW FUSION FOR 3D HUMAN POSE ESTIMATION
    Zhang, Lijun
    Zhou, Kangkang
    Liu, Liangchen
    Li, Zhenghao
    Zhao, Xunyi
    Zhou, Xiang-Dong
    Shi, Yu
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1600 - 1604
  • [5] Multi-view 3D Human Pose Estimation in Complex Environment
    Hofmann, M.
    Gavrila, D. M.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 96 (01) : 103 - 124
  • [6] Markerless multi-view 3D human pose estimation: A survey
    Nogueira, Ana Filipa Rodrigues
    Oliveira, Helder P.
    Teixeira, Luis F.
    IMAGE AND VISION COMPUTING, 2025, 155
  • [7] Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation
    Jiang, Boyuan
    Hu, Lei
    Xia, Shihong
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14804 - 14814
  • [8] 3D Human Pose Estimation from Deep Multi-View 2D Pose
    Schwarcz, Steven
    Pollard, Thomas
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2326 - 2331
  • [9] Lightweight Multi-View 3D Pose Estimation through Camera-Disentangled Representation
    Remelli, Edoardo
    Han, Shangchen
    Honari, Sina
    Fua, Pascal
    Wang, Robert
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6039 - 6048
  • [10] Multi-View Pose Generator Based on Deep Learning for Monocular 3D Human Pose Estimation
    Sun, Jun
    Wang, Mantao
    Zhao, Xin
    Zhang, Dejun
    SYMMETRY-BASEL, 2020, 12 (07):