Edge Deployment Framework of GuardBot for Optimized Face Mask Recognition With Real-Time Inference Using Deep Learning

被引:11
作者
Manzoor, Sumaira [1 ]
Kim, Eun-Jin [2 ]
Joo, Sung-Hyeon [2 ]
Bae, Sang-Hyeon [2 ]
In, Gun-Gyo [2 ]
Joo, Kyeong-Jin [2 ]
Choi, Jun-Hyeon [2 ]
Kuc, Tae-Yong [2 ]
机构
[1] Creat Algorithms & Sensor Evolut Lab, Suwon 16419, South Korea
[2] Sungkyunkwan Univ, Coll Informat & Commun Engn, Dept Elect & Comp Engn, Suwon 16419, South Korea
关键词
Face recognition; Computational modeling; Robots; Optimization; Convolutional neural networks; Deep learning; COVID-19; Face detection; face mask classification; voice alert; service robot; deep learning; transfer learning; Jetson Xavier NX; TensorFlow TensorRT; optimization; inference; CHALLENGES; SECURITY;
D O I
10.1109/ACCESS.2022.3190538
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning based models on the edge devices have received considerable attention as a promising means to handle a variety of AI applications. However, deploying the deep learning models in the production environment with efficient inference on the edge devices is still a challenging task due to computation and memory constraints. This paper proposes a framework for the service robot named GuardBot powered by Jetson Xavier NX and presents a real-world case study of deploying the optimized face mask recognition application with real-time inference on the edge device. It assists the robot to detect whether people are wearing a mask to guard against COVID-19 and gives a polite voice reminder to wear the mask. Our framework contains dual-stage architecture based on convolutional neural networks with three main modules that employ (1) MTCNN for face detection, (2) our proposed CNN model and seven transfer learning based custom models which are Inception-v3, VGG16, denseNet121, resNet50, NASNetMobile, XceptionNet, MobileNet-v2 for face mask classification, (3) TensorRT for optimization of all the models to speedup inference on the Jetson Xavier NX. Our study carries out several analysis based on the models' performance in terms of their frames per second, execution time and images per second. It also evaluates the accuracy, precision, recall & F1-score and makes the comparison of all models before and after optimization with a main focus on high throughput and low latency. Finally, the framework is deployed on a mobile robot to perform experiments in both outdoor and multi-floor indoor environments with patrolling and non-patrolling modes. Compared to other state-of-the-art models, our proposed CNN model for face mask recognition based on the classification obtains 94.5%, 95.9% and 94.28% accuracy on training, validation and testing datasets respectively which is better than MobileNet-v2, Xception and InceptionNet-v3 while it achieves highest throughput and lowest latency than all other models after optimization at different precision levels.
引用
收藏
页码:77898 / 77921
页数:24
相关论文
共 117 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Alzantot Moustafa, 2017, MobiSys, V2017, P7, DOI 10.1145/3089801.3089805
  • [3] [Anonymous], How COVID-19 Spreads
  • [4] [Anonymous], 2022, ROBOT REMINDS JAPAN
  • [5] [Anonymous], 2017, NIPS
  • [6] [Anonymous], 2020, MASK USE CONTEXT COV
  • [7] Birje Mahantesh N., 2017, International Journal of Cloud Computing, V6, P32
  • [8] Edge Computing in IoT-Based Manufacturing
    Chen, Baotong
    Wan, Jiafu
    Celesti, Antonio
    Li, Di
    Abbas, Haider
    Zhang, Qin
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (09) : 103 - 109
  • [9] Deep Learning With Edge Computing: A Review
    Chen, Jiasi
    Ran, Xukan
    [J]. PROCEEDINGS OF THE IEEE, 2019, 107 (08) : 1655 - 1674
  • [10] Chen TQ, 2018, PROCEEDINGS OF THE 13TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P579