THPoseLite, a Lightweight Neural Network for Detecting Pose in Thermal Images

被引:8
作者
Lupion, Marcos [1 ]
Gonzalez-Ruiz, Vicente [1 ]
Medina-Quero, Javier [2 ]
Sanjuan, Juan F. [1 ]
Ortigosa, Pilar M. [1 ]
机构
[1] Univ Almeria, Dept Informat, CeIA3, Almeria 04120, Spain
[2] Univ Granada, Higher Tech Sch Comp Engn & Telecommun, Dept Comp Engn Automat & Robot, Granada 18071, Spain
关键词
Auto-labeling; edge accelerator; pose estimation; quantization; thermal image (TI); FALL DETECTION;
D O I
10.1109/JIOT.2023.3264215
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, smart environments (SEs) enable the monitoring of people with physical disabilities by incorporating activity recognition. Thermal cameras are being incorporated as they preserve privacy. Some deep learning (DL) solutions use the pose of the users because it removes external noise. Although there are robust DL solutions in the visible spectrum (VS), they fail in the thermal domain. Thus, we propose thermal human pose lite (THPoseLite), a convolutional neural network (CNN) based on MobileNetV2 that extracts pose from thermal images (TIs). In a novel way, an auto-labeling approach has been developed. It includes a background removal using an optical flow estimator. It also integrates Blazepose [a pose estimator for VS images (VSIs)] to obtain the poses in the preprocessed TIs. Results show that the preprocessing increases the percentage of detected poses by Blazepose from 19.55% to 76.85%. This allows the recording of human pose estimation (HPE) data sets in the VS without requiring VS cameras or manually annotating data sets. Furthermore, THPoseLite has been embedded in an Internet of Things (IoT) device incorporating an edge tensor processing unit (TPU) accelerator, which can process TIs recorded at 9 frames per second (FPS) in real time (12.28 FPS). It requires fewer than 6W of energy to run. It has been achieved using model quantization, decreasing the accuracy in estimating the poses by only 1%. The mean-squared error of MobileNetV2 in test images is 35.48, obtaining accurate poses in 21% of the images that Blazepose is not able to detect any pose.
引用
收藏
页码:15060 / 15073
页数:14
相关论文
共 56 条
[1]   2D Human Pose Estimation: New Benchmark and State of the Art Analysis [J].
Andriluka, Mykhaylo ;
Pishchulin, Leonid ;
Gehler, Peter ;
Schiele, Bernt .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3686-3693
[2]  
[Anonymous], 2021, Falls
[3]   Action Recognition From Thermal Videos [J].
Batchuluun, Ganbayar ;
Nguyen, Dat Tien ;
Tuyen Danh Pham ;
Park, Chanhum ;
Park, Kang Ryoung .
IEEE ACCESS, 2019, 7 :103893-103917
[4]  
Bazarevsky V, 2020, Arxiv, DOI [arXiv:2006.10204, DOI 10.48550/ARXIV.2006.10204]
[5]  
Bazarevsky V, 2019, Arxiv, DOI [arXiv:1907.05047, DOI 10.48550/ARXIV.1907.05047]
[6]   An Overview on Edge Computing Research [J].
Cao, Keyan ;
Liu, Yefan ;
Meng, Gongjie ;
Sun, Qimeng .
IEEE ACCESS, 2020, 8 :85714-85728
[7]   IN-BED HUMAN POSE ESTIMATION FROM UNSEEN AND PRIVACY-PRESERVING IMAGE DOMAINS [J].
Cao, Ting ;
Armin, Mohammad Ali ;
Denman, Simon ;
Petersson, Lars ;
Ahmedt-Aristizabal, David .
2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
[8]  
Cao Z, 2017, Arxiv, DOI arXiv:1611.08050
[9]   OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields [J].
Cao, Zhe ;
Hidalgo, Gines ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) :172-186
[10]   Multi-Person Pose Estimation Using Thermal Images [J].
Chen, I-Chien ;
Wang, Chang-Jen ;
Wen, Chao-Kai ;
Tzou, Shiow-Jyu .
IEEE ACCESS, 2020, 8 :174964-174971