Enhancing 3D human pose estimation with NIR single-pixel imaging and time-of-flight technology: a deep learning approach

被引：0

作者：

Quero, Carlos Osorio ^{[1
]}

Durini, Daniel ^{[1
]}

Rangel-Magdaleno, Jose

Martinez-Carranza, Jose ^{[2
]}

Ramos-Garcia, Ruben ^{[3
]}

机构：

[1] Inst Nacl Astrofis Opt & Electr, Elect Dept, Digital Syst Grp, Puebla 72840, Mexico

[2] Inst Nacl Astrofis Opt & Electr, Comp Sci Dept, Puebla 72810, Mexico

[3] Inst Nacl Astrofis Opt & Electr, Opt Dept, Puebla 72810, Mexico

来源：

JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION | 2024年 / 41卷 / 03期

关键词：

GAIT RECOGNITION; IMAGES; HAND; MESH;

D O I：

10.1364/JOSAA.499933

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

The extraction of 3D human pose and body shape details from a single monocular image is a significant challenge in computer vision. Traditional methods use RGB images, but these are constrained by varying lighting and occlusions. However, cutting -edge developments in imaging technologies have introduced new techniques such as single -pixel imaging (SPI) that can surmount these hurdles. In the near -infrared spectrum, SPI demonstrates impressive capabilities in capturing a 3D human pose. This wavelength can penetrate clothing and is less influenced by lighting variations than visible light, thus providing a reliable means to accurately capture body shape and pose data, even in difficult settings. In this work, we explore the use of an SPI camera operating in the NIR with time -of -flight (TOF) at bands 850-1550 nm as a solution to detect humans in nighttime environments. The proposed system uses the vision transformers (ViT) model to detect and extract the characteristic features of humans for integration over a 3D body model SMPL-X through 3D body shape regression using deep learning. To evaluate the efficacy of NIR-SPI 3D image reconstruction, we constructed a laboratory scenario that simulates nighttime conditions, enabling us to test the feasibility of employing NIR-SPI as a vision sensor in outdoor environments. By assessing the results obtained from this setup, we aim to demonstrate the potential of NIR-SPI as an effective tool to detect humans in nighttime scenarios and capture their accurate 3D body pose and shape. (c) 2024 Optica Publishing Group

引用

页码：414 / 423

页数：10

共 85 条

[1] SCAPE: Shape Completion and Animation of People
Anguelov, D
Srinivasan, P
Koller, D
Thrun, S
Rodgers, J
Davis, J
[J]. ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03): : 408 - 416
[2] Bañuls A, 2020, IEEE INT SYMP SAFE, P380, DOI [10.1109/ssrr50563.2020.9292593, 10.1109/SSRR50563.2020.9292593]
[3] Bao Wenxia, 2023, 2023 2nd International Conference on Big Data, Information and Computer Network (BDICN), P264, DOI 10.1109/BDICN58493.2023.00061
[4] FaceWarehouse: A 3D Facial Expression Database for Visual Computing
Cao, Chen
Weng, Yanlin
Zhou, Shun
Tong, Yiying
Zhou, Kun
[J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2014, 20 (03) : 413 - 425
[5] A 2D Markerless Gait Analysis Methodology: Validation on Healthy Subjects
Castelli, Andrea
Paolini, Gabriele
Cereatti, Andrea
Della Croce, Ugo
[J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2015, 2015
[6] 3D human body reconstruction based on SMPL model
Chen, Dongyue
Song, Yuanyuan
Liang, Fangzheng
Ma, Teng
Zhu, Xiaoming
Jia, Tong
[J]. VISUAL COMPUTER, 2023, 39 (05) : 1893 - 1906
[7] Cholesky Factorization on Heterogeneous CPU and GPU Systems
Chen, Jieyang
Chen, Zizhong
[J]. 2015 NINTH INTERNATIONAL CONFERENCE ON FRONTIER OF COMPUTER SCIENCE AND TECHNOLOGY FCST 2015, 2015, : 19 - 26
[8] Personnel Recognition and Gait Classification Based on Multistatic Micro-Doppler Signatures Using Deep Convolutional Neural Networks
Chen, Zhaoxi
Li, Gang
Fioranelli, Francesco
Griffiths, Hugh
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (05) : 669 - 673
[9] Efficient Use of GPU Memory for Large-Scale Deep Learning Model Training
Choi, Hyeonseong
Lee, Jaehwan
[J]. APPLIED SCIENCES-BASEL, 2021, 11 (21):
[10] Accurate 3D Body Shape Regression using Metric and Semantic Attributes
Choutas, Vasileios
Mueller, Lea
Huang, Chun-Hao P.
Tang, Siyu
Tzionas, Dimitrios
Black, Michael J.
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2708 - 2718

← 1 2 3 4 5 6 7 8 9 →