Low-light aware framework for human activity recognition via optimized dual stream parallel network

被引:23
作者
Hussain, Altaf [1 ]
Khan, Samee Ullah [1 ]
Khan, Noman [1 ]
Rida, Imad [2 ]
Alharbi, Meshal [3 ]
Baik, Sung Wook [1 ]
机构
[1] Sejong Univ, Seoul 143747, South Korea
[2] Univ Technol Compiegne, Ctr Rech Royallieu, Lab Biomecan & Bioingn, Compiegne, France
[3] Prince Sattam Bin Abdulaziz Univ, Coll Comp Engn & Sci, Dept Comp Sci, Alkharj 11942, Saudi Arabia
关键词
Human Activity Recogni-tion; Internet of Things; Personalized Communica-tion; Convolutional Neural Net-work; Dual Stream Network; Deep Learning; LSTM; SURVEILLANCE; ATTENTION; FEATURES; FLOW; CNN;
D O I
10.1016/j.aej.2023.05.050
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Human Activity Recognition (HAR) plays a crucial role in communication and the Internet of Things (IoT), by enabling vision sensors to understand and respond to human behavior more intelligently and efficiently. Existing deep learning models are complex to deal with the low illumination, diverse viewpoints, and cluttered backgrounds, which require substantial computing resources and are not appropriate for edge devices. Furthermore, without an effective video analysis technique it processes entire frames, resulting inadequate performance. To address these key challenges, a cloud-assisted IoT computing framework is proposed for HAR in uncertain low-lighting environments, which is mainly composed of two tiers: edge and cloud computing. Initially, a lightweight Convolutional Neural Network (CNN) model is developed which is responsible to enhance the low-light frames, followed by the human detection algorithm to process the selective frames, thus enabling efficient resource utilization. Next, these refined frames are then transmitted to the cloud for accurate HAR, where dual stream CNN and transformer fusion network extract both short-and long-range spatiotemporal discriminative features followed by proposed Optimized Parallel Sequential Temporal Network (OPSTN) with squeeze and excitation attention to efficiently learn HAR in complex scenarios. Finally, extensive experiments are conducted over three challenging HAR datasets to deeply examine the proposed framework from various perspectives such as complex activity recognition, lowlighting, etc., where the results are outperformed compared with the state-of-art methods. & COPY; 2023 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Alexandria University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/ licenses/by-nc-nd/4.0/).
引用
收藏
页码:569 / 583
页数:15
相关论文
共 63 条
[1]  
Abdulmunem A, 2016, INT C PATT RECOG, P805, DOI 10.1109/ICPR.2016.7899734
[2]   The computation of optical flow [J].
Beauchemin, SS ;
Barron, JL .
ACM COMPUTING SURVEYS, 1995, 27 (03) :433-467
[3]   Activity Recognition based on a Magnitude-Orientation Stream Network [J].
Caetano, Carlos ;
de Melo, Victor H. C. ;
dos Santos, Jefersson A. ;
Schwartz, William Robson .
2017 30TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2017, :47-54
[4]   Learning a Deep Single Image Contrast Enhancer from Multi-Exposure Images [J].
Cai, Jianrui ;
Gu, Shuhang ;
Zhang, Lei .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (04) :2049-2062
[5]   Multi-View Super Vector for Action Recognition [J].
Cai, Zhuowei ;
Wang, Limin ;
Peng, Xiaojiang ;
Qiao, Yu .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :596-603
[6]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[7]  
Chen ZX, 2020, Arxiv, DOI arXiv:1811.07059
[8]   Exact histogram specification [J].
Coltuc, D ;
Bolon, P ;
Chassery, JM .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (05) :1143-1152
[9]   ConViT: improving vision transformers with soft convolutional inductive biases [J].
d'Ascoli, Stephane ;
Touvron, Hugo ;
Leavitt, Matthew L. ;
Morcos, Ari S. ;
Biroli, Giulio ;
Sagun, Levent .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2022, 2022 (11)
[10]   Human action recognition using two-stream attention based LSTM networks [J].
Dai, Cheng ;
Liu, Xingang ;
Lai, Jinfeng .
APPLIED SOFT COMPUTING, 2020, 86