Simultaneously Recovering Multi-Person Meshes and Multi-View Cameras With Human Semantics

被引：3

作者：

Huang, Buzhen ^{[1
,2
]}

Ju, Jingyi ^{[1
,2
]}

Shu, Yuan ^{[1
,2
]}

Wang, Yangang ^{[1
,2
]}

机构：

[1] Southeast Univ, Key Lab Measurement & Control Complex Syst Engn, Minist Educ, Nanjing 210096, Peoples R China

[2] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Cameras; Calibration; Semantics; Optimization; Three-dimensional displays; Dynamics; Noise measurement; Multi-person mesh recovery; camera calibration; motion prior and cross-view correspondence; MARKERLESS MOTION CAPTURE; POSE ESTIMATION; CALIBRATION; TRACKING; POINTS; SHAPE;

D O I：

10.1109/TCSVT.2023.3328371

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Dynamic multi-person mesh recovery has broad applications in sports broadcasting, virtual reality, and video games. However, current multi-view frameworks rely on a time-consuming camera calibration procedure. In this work, we focus on multi-person motion capture with uncalibrated cameras, which mainly faces two challenges: one is that inter-person interactions and occlusions introduce inherent ambiguities for both camera calibration and motion capture; the other is that a lack of dense correspondences can be used to constrain sparse camera geometries in a dynamic multi-person scene. Our key idea is to incorporate motion prior knowledge to simultaneously estimate camera parameters and human meshes from noisy human semantics. We first utilize human information from 2D images to initialize intrinsic and extrinsic parameters. Thus, the approach does not rely on any other calibration tools or background features. Then, a pose-geometry consistency is introduced to associate the detected humans from different views. Finally, a latent motion prior is proposed to refine the camera parameters and human motions. Experimental results show that accurate camera parameters and human motions can be obtained through a one-step reconstruction. The code are publicly available at https://github.com/boycehbz/DMMR.

引用

页码：4229 / 4242

页数：14

共 81 条

[21] AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time [J].

Fang, Hao-Shu ;

Li, Jiefeng ;

Tang, Hongyang ;

Xu, Chao ;

Zhu, Haoyi ;

Xiu, Yuliang ;

Li, Yong-Lu ;

Lu, Cewu .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) :7157-7173

[22] Single View Physical Distance Estimation using Human Pose [J].

Fei, Xiaohan ;

Wang, Henry ;

Cheong, Lin Lee ;

Zeng, Xiangyu ;

Wang, Meng ;

Tighe, Joseph .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :12386-12396

[23] Self-supervised Multi-view Multi-Human Association and Tracking [J].

Gan, Yiyang ;

Han, Ruize ;

Yin, Liqiang ;

Feng, Wei ;

Wang, Song .

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :282-290

[24] Fast automatic camera network calibration through human mesh recovery [J].

Garau, Nicola ;

De Natale, Francesco G. B. ;

Conci, Nicola .

JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (06) :1757-1768

[25] Unsupervised continuous camera network pose estimation through human mesh recovery [J].

Garau, Nicola ;

Conci, Nicola .

ICDSC 2019: 13TH INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS, 2019,

[26]

Geman S., 1987, B INT STAT I, V4, P5

[27] Deltille Grids for Geometric Camera Calibration [J].

Ha, Hyowon ;

Perdoch, Michal ;

Alismail, Hatem ;

Kweon, In So ;

Sheikh, Yaser .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5354-5362

[28] Object-Occluded Human Shape and Pose Estimation With Probabilistic Latent Consistency [J].

Huang, Buzhen ;

Zhang, Tianshu ;

Wang, Yangang .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) :5010-5026

[29] Pose2UV: Single-Shot Multiperson Mesh Recovery With Deep UV Prior [J].

Huang, Buzhen ;

Zhang, Tianshu ;

Wang, Yangang .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :4679-4692

[30] Dynamic Multi-Person Mesh Recovery From Uncalibrated Multi-View Cameras [J].

Huang, Buzhen ;

Shu, Yuan ;

Zhang, Tianshu ;

Wang, Yangang .

2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, :710-720

← 1 2 3 4 5 6 7 8 9 →