Low-light aware framework for human activity recognition via optimized dual stream parallel network

被引：23

作者：

Hussain, Altaf ^{[1
]}

Khan, Samee Ullah ^{[1
]}

Khan, Noman ^{[1
]}

Rida, Imad ^{[2
]}

Alharbi, Meshal ^{[3
]}

Baik, Sung Wook ^{[1
]}

机构：

[1] Sejong Univ, Seoul 143747, South Korea

[2] Univ Technol Compiegne, Ctr Rech Royallieu, Lab Biomecan & Bioingn, Compiegne, France

[3] Prince Sattam Bin Abdulaziz Univ, Coll Comp Engn & Sci, Dept Comp Sci, Alkharj 11942, Saudi Arabia

来源：

ALEXANDRIA ENGINEERING JOURNAL | 2023年 / 74卷

关键词：

Human Activity Recogni-tion; Internet of Things; Personalized Communica-tion; Convolutional Neural Net-work; Dual Stream Network; Deep Learning; LSTM; SURVEILLANCE; ATTENTION; FEATURES; FLOW; CNN;

D O I：

10.1016/j.aej.2023.05.050

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Human Activity Recognition (HAR) plays a crucial role in communication and the Internet of Things (IoT), by enabling vision sensors to understand and respond to human behavior more intelligently and efficiently. Existing deep learning models are complex to deal with the low illumination, diverse viewpoints, and cluttered backgrounds, which require substantial computing resources and are not appropriate for edge devices. Furthermore, without an effective video analysis technique it processes entire frames, resulting inadequate performance. To address these key challenges, a cloud-assisted IoT computing framework is proposed for HAR in uncertain low-lighting environments, which is mainly composed of two tiers: edge and cloud computing. Initially, a lightweight Convolutional Neural Network (CNN) model is developed which is responsible to enhance the low-light frames, followed by the human detection algorithm to process the selective frames, thus enabling efficient resource utilization. Next, these refined frames are then transmitted to the cloud for accurate HAR, where dual stream CNN and transformer fusion network extract both short-and long-range spatiotemporal discriminative features followed by proposed Optimized Parallel Sequential Temporal Network (OPSTN) with squeeze and excitation attention to efficiently learn HAR in complex scenarios. Finally, extensive experiments are conducted over three challenging HAR datasets to deeply examine the proposed framework from various perspectives such as complex activity recognition, lowlighting, etc., where the results are outperformed compared with the state-of-art methods. & COPY; 2023 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Alexandria University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/ licenses/by-nc-nd/4.0/).

引用

页码：569 / 583

页数：15

共 63 条

[1]

Abdulmunem A, 2016, INT C PATT RECOG, P805, DOI 10.1109/ICPR.2016.7899734

[2] The computation of optical flow [J].