3D human pose estimation in multi-view operating room videos using differentiable camera projections

被引：2

作者：

Gerats, Beerend G. A. ^{[1
,2
,5
]}

Wolterink, Jelmer M. ^{[3
,4
]}

Broeders, Ivo A. M. J. ^{[1
,2
]}

机构：

[1] Meander Med Ctr, Ctr Artificial Intelligence, Amersfoort, Netherlands

[2] Univ Twente, Robot & Mechatron, Enschede, Netherlands

[3] Univ Twente, Dept Appl Math, Enschede, Netherlands

[4] Univ Twente, Tech Med Ctr, Enschede, Netherlands

[5] Meander Med Ctr, Ctr Artificial Intelligence, Maatweg 3, NL-3813 TZ Amersfoort, Netherlands

来源：

COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION | 2023年 / 11卷 / 04期

关键词：

Human pose estimation; operating room; differentiable camera projection;

D O I：

10.1080/21681163.2022.2155580

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

3D human pose estimation in multi-view operating room (OR) videos is a relevant asset for person tracking and action recognition. However, the surgical environment makes it challenging to find poses due to sterile clothing, frequent occlusions and limited public data. Methods specifically designed for the OR are generally based on the fusion of detected poses in multiple camera views. Typically, a 2D pose estimator such as a convolutional neural network (CNN) detects joint locations. Then, the detected joint locations are projected to 3D and fused over all camera views. However, accurate detection in 2D does not guarantee accurate localisation in 3D space. In this work, we propose to directly optimise for localisation in 3D by training 2D CNNs end-to-end based on a 3D loss that is backpropagated through each camera's projection parameters. Using videos from the MVOR dataset, we show that this end-to-end approach outperforms optimisation in 2D space.

引用

页码：1197 / 1205

页数：9

共 50 条

[1] Multi-view Pictorial Structures for 3D Human Pose Estimation
Amin, Sikandar
Andriluka, Mykhaylo
Rohrbach, Marcus
Schiele, Bernt
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
[2] Multi-view 3D Human Pose Estimation in Complex Environment
M. Hofmann
D. M. Gavrila
International Journal of Computer Vision, 2012, 96 : 103 - 124
[3] Generative Multi-View Based 3D Human Pose Estimation
Sabri, Motaz
PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY, SIET 2021, 2021, : 2 - 9
[4] PROGRESSIVE MULTI-VIEW FUSION FOR 3D HUMAN POSE ESTIMATION
Zhang, Lijun
Zhou, Kangkang
Liu, Liangchen
Li, Zhenghao
Zhao, Xunyi
Zhou, Xiang-Dong
Shi, Yu
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1600 - 1604
[5] Multi-view 3D Human Pose Estimation in Complex Environment
Hofmann, M.
Gavrila, D. M.
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 96 (01) : 103 - 124
[6] Markerless multi-view 3D human pose estimation: A survey
Nogueira, Ana Filipa Rodrigues
Oliveira, Helder P.
Teixeira, Luis F.
IMAGE AND VISION COMPUTING, 2025, 155
[7] Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation
Jiang, Boyuan
Hu, Lei
Xia, Shihong
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14804 - 14814
[8] 3D Human Pose Estimation from Deep Multi-View 2D Pose
Schwarcz, Steven
Pollard, Thomas
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2326 - 2331
[9] Lightweight Multi-View 3D Pose Estimation through Camera-Disentangled Representation
Remelli, Edoardo
Han, Shangchen
Honari, Sina
Fua, Pascal
Wang, Robert
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6039 - 6048
[10] Multi-View Pose Generator Based on Deep Learning for Monocular 3D Human Pose Estimation
Sun, Jun
Wang, Mantao
Zhao, Xin
Zhang, Dejun
SYMMETRY-BASEL, 2020, 12 (07):

← 1 2 3 4 5 →