Enhancing 3D human pose estimation with NIR single-pixel imaging and time-of-flight technology: a deep learning approach

被引:0
作者
Quero, Carlos Osorio [1 ]
Durini, Daniel [1 ]
Rangel-Magdaleno, Jose
Martinez-Carranza, Jose [2 ]
Ramos-Garcia, Ruben [3 ]
机构
[1] Inst Nacl Astrofis Opt & Electr, Elect Dept, Digital Syst Grp, Puebla 72840, Mexico
[2] Inst Nacl Astrofis Opt & Electr, Comp Sci Dept, Puebla 72810, Mexico
[3] Inst Nacl Astrofis Opt & Electr, Opt Dept, Puebla 72810, Mexico
关键词
GAIT RECOGNITION; IMAGES; HAND; MESH;
D O I
10.1364/JOSAA.499933
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
The extraction of 3D human pose and body shape details from a single monocular image is a significant challenge in computer vision. Traditional methods use RGB images, but these are constrained by varying lighting and occlusions. However, cutting -edge developments in imaging technologies have introduced new techniques such as single -pixel imaging (SPI) that can surmount these hurdles. In the near -infrared spectrum, SPI demonstrates impressive capabilities in capturing a 3D human pose. This wavelength can penetrate clothing and is less influenced by lighting variations than visible light, thus providing a reliable means to accurately capture body shape and pose data, even in difficult settings. In this work, we explore the use of an SPI camera operating in the NIR with time -of -flight (TOF) at bands 850-1550 nm as a solution to detect humans in nighttime environments. The proposed system uses the vision transformers (ViT) model to detect and extract the characteristic features of humans for integration over a 3D body model SMPL-X through 3D body shape regression using deep learning. To evaluate the efficacy of NIR-SPI 3D image reconstruction, we constructed a laboratory scenario that simulates nighttime conditions, enabling us to test the feasibility of employing NIR-SPI as a vision sensor in outdoor environments. By assessing the results obtained from this setup, we aim to demonstrate the potential of NIR-SPI as an effective tool to detect humans in nighttime scenarios and capture their accurate 3D body pose and shape. (c) 2024 Optica Publishing Group
引用
收藏
页码:414 / 423
页数:10
相关论文
共 85 条
  • [1] SCAPE: Shape Completion and Animation of People
    Anguelov, D
    Srinivasan, P
    Koller, D
    Thrun, S
    Rodgers, J
    Davis, J
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03): : 408 - 416
  • [2] Bañuls A, 2020, IEEE INT SYMP SAFE, P380, DOI [10.1109/ssrr50563.2020.9292593, 10.1109/SSRR50563.2020.9292593]
  • [3] Bao Wenxia, 2023, 2023 2nd International Conference on Big Data, Information and Computer Network (BDICN), P264, DOI 10.1109/BDICN58493.2023.00061
  • [4] FaceWarehouse: A 3D Facial Expression Database for Visual Computing
    Cao, Chen
    Weng, Yanlin
    Zhou, Shun
    Tong, Yiying
    Zhou, Kun
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2014, 20 (03) : 413 - 425
  • [5] A 2D Markerless Gait Analysis Methodology: Validation on Healthy Subjects
    Castelli, Andrea
    Paolini, Gabriele
    Cereatti, Andrea
    Della Croce, Ugo
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2015, 2015
  • [6] 3D human body reconstruction based on SMPL model
    Chen, Dongyue
    Song, Yuanyuan
    Liang, Fangzheng
    Ma, Teng
    Zhu, Xiaoming
    Jia, Tong
    [J]. VISUAL COMPUTER, 2023, 39 (05) : 1893 - 1906
  • [7] Cholesky Factorization on Heterogeneous CPU and GPU Systems
    Chen, Jieyang
    Chen, Zizhong
    [J]. 2015 NINTH INTERNATIONAL CONFERENCE ON FRONTIER OF COMPUTER SCIENCE AND TECHNOLOGY FCST 2015, 2015, : 19 - 26
  • [8] Personnel Recognition and Gait Classification Based on Multistatic Micro-Doppler Signatures Using Deep Convolutional Neural Networks
    Chen, Zhaoxi
    Li, Gang
    Fioranelli, Francesco
    Griffiths, Hugh
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (05) : 669 - 673
  • [9] Efficient Use of GPU Memory for Large-Scale Deep Learning Model Training
    Choi, Hyeonseong
    Lee, Jaehwan
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (21):
  • [10] Accurate 3D Body Shape Regression using Metric and Semantic Attributes
    Choutas, Vasileios
    Mueller, Lea
    Huang, Chun-Hao P.
    Tang, Siyu
    Tzionas, Dimitrios
    Black, Michael J.
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2708 - 2718