High-precision human body acquisition via multi-view binocular stereopsis

被引:9
作者
Ran, Qing [1 ]
Zhou, Kaimao [1 ]
Yang, Yong-Liang [2 ]
Kang, Junpeng [1 ]
Zhu, Linan [1 ]
Tang, Yizhi [1 ]
Feng, Jieqing [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD & CG, Hangzhou 310058, Peoples R China
[2] Univ Bath, Bath BA2 7AY, Avon, England
来源
COMPUTERS & GRAPHICS-UK | 2020年 / 87卷
基金
中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Human body modeling; Acquisition system; Stereo matching; Anthropometry; SURFACE RECONSTRUCTION; HUMAN POSE; SHAPE;
D O I
10.1016/j.cag.2020.01.003
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
It remains challenging how to acquire a human body shape with high precision and evaluate the reconstructed models effectively, because the results can be easily affected by various factors (e.g., the performance of the capture device, the unwanted movement of the subject, and the self-occlusion of the articulated body structure). To tackle the above challenges, this research presents a passive acquisition system, which comprises 60 spatially-configured Digital Single Lens Reflex (DSLR) cameras and a carefully devised algorithmic pipeline for shape acquisition in a single shot. Different from traditional multi-view stereo solutions, the constituent cameras are synchronized and organized into 30 binocular stereo rigs to capture images from multiple views simultaneously. Each binocular stereo rig is regarded as a depth sensor. The acquisition pipeline consists of three stages. First, camera calibration is performed to estimate intrinsic and extrinsic parameters of all cameras, especially for paired binocular cameras. Second, depth inference based on stereo matching is employed to recover reliable depth information from RGB images. A novel hierarchical seed-propagation stereo matching framework is proposed, resulting in 30 dense and uniform-distributed partial point clouds. Finally, a point-based geometry processing step composed of multi-view registration and surface meshing is carried out to obtain high-quality watertight human body shapes. This research also proposes an elaborate and novel method to assess the accuracy of reconstructed non-rigid human body model based on anthropometry parameters, which solves the synchronization of the ground-truth values and the measured values. Experimental results show that the system can achieve the reconstruction accuracy within 2.5 mm in average. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页码:43 / 61
页数:19
相关论文
共 29 条
  • [11] Simultaneously Recovering Multi-Person Meshes and Multi-View Cameras With Human Semantics
    Huang, Buzhen
    Ju, Jingyi
    Shu, Yuan
    Wang, Yangang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4229 - 4242
  • [12] DS-MVSNet: Unsupervised Multi-view Stereo via Depth Synthesis
    Li, Jingliang
    Lu, Zhengda
    Wang, Yiqun
    Wang, Ying
    Xiao, Jun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5593 - 5601
  • [13] Sketch-Based Shape Retrieval via Multi-view Attention and Generalized Similarity
    Xu, Yongzhe
    Hu, Jiangchuan
    Zeng, Kun
    Gong, YongYi
    2018 7TH INTERNATIONAL CONFERENCE ON DIGITAL HOME (ICDH 2018), 2018, : 311 - 317
  • [14] Robust Face Alignment by Multi-Order High-Precision Hourglass Network
    Wan, Jun
    Lai, Zhihui
    Liu, Jun
    Zhou, Jie
    Gao, Can
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 121 - 133
  • [15] Robust multi-view learning via M-estimator joint sparse representation
    Hu, Yutao
    Wang, Yulong
    Li, Han
    Chen, Hong
    PATTERN RECOGNITION, 2024, 151
  • [16] High-precision bicycle detection on single side-view image based on the geometric relationship
    Lin, Yen-Bor
    Young, Chung-Ping
    PATTERN RECOGNITION, 2017, 63 : 334 - 354
  • [17] Towards high-resolution large-scale multi-view stereo
    Hiep, Vu Hoang
    Keriven, Renaud
    Labatut, Patrick
    Pons, Jean-Philippe
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 1430 - 1437
  • [18] Markerless Multi-view Human Motion Tracking Using Manifold Model Learning by Charting
    Saini, Sanjay
    Rambli, Dayang Rohaya Bt Awang
    Sulaiman, Suziah Bt
    Zakaria, M. Nordin B.
    Rohkmah, Siti
    INTERNATIONAL SYMPOSIUM ON ROBOTICS AND INTELLIGENT SENSORS 2012 (IRIS 2012), 2012, 41 : 664 - 670
  • [19] Multi-view human movement recognition based on fuzzy distances and linear discriminant analysis
    Iosifidis, Alexandros
    Tefas, Anastasios
    Nikolaidis, Nikolaos
    Pitas, Ioannis
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2012, 116 (03) : 347 - 360
  • [20] Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation
    Wen, Chao
    Zhang, Yinda
    Li, Zhuwen
    Fu, Yanwei
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1042 - 1051