Inception inspired CNN-GRU hybrid network for human activity recognition

被引:87
作者
Dua, Nidhi [1 ]
Singh, Shiva Nand [1 ]
Semwal, Vijay Bhaskar [2 ]
Challa, Sravan Kumar [1 ]
机构
[1] Natl Inst Technol Jamshedpur, Dept ECE, Jamshedpur, Jharkhand, India
[2] Maulana Azad Natl Inst Technol, Dept CSE, Bhopal, MP, India
关键词
Convolutional neural network; HAR; Inception; Gated recurrent unit; Wearable sensors; Human-computer interaction; Pattern recognition; TIME-SERIES; SYSTEM; MODEL;
D O I
10.1007/s11042-021-11885-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human Activity Recognition (HAR) involves the recognition of human activities using sensor data. Most of the techniques for HAR involve hand-crafted features and hence demand a good amount of human intervention. Moreover, the activity data obtained from sensors are highly imbalanced and hence demand a robust classifier design. In this paper, a novel classifier "ICGNet" is proposed for HAR, which is a hybrid of Convolutional Neural Network (CNN) and Gated Recurrent Unit (GRU). The CNN block used in the proposed network derives its inspiration from the famous Inception module. It uses multiple-sized convolutional filters simultaneously over the input and thus can capture the information in the data at multiple scales. These multi-sized filters introduced at the same level in the convolution network helps to compute more abstract features for local patches of data. It also makes use of 1 x 1 convolution to pool the input across channel dimension, and the intuition behind it is that it helps the model extract the valuable information hidden across the channels. The proposed ICGNet leverages the strengths of CNN and GRU and hence can capture local features and long-term dependencies in the multivariate time series data. It is an end-to-end model for HAR that can process raw data captured from wearable sensors without using any manual feature engineering. Integrating the adaptive user interfaces, the proposed HAR system can be applied to Human-Computer Interaction (HCI) fields such as interactive games, robot learning, health monitoring, and pattern-based surveillance. The overall accuracies achieved on two benchmark datasets viz. MHEALTH and PAMAP2 are 99.25% and 97.64%, respectively. The results indicate that the proposed network outperformed the similar architectures proposed for HAR in the literature.
引用
收藏
页码:5369 / 5403
页数:35
相关论文
共 71 条
[1]  
Ahad A R., 2020, Intelligent systems reference library, P13, DOI DOI 10.1007/978-3-030-51379-5_2
[2]  
Anguita Davide., 2013, ESANN, V3, page, P3
[3]  
[Anonymous], 2019, ARXIV190503809
[4]   Activity Recognition and Abnormal Behaviour Detection with Recurrent Neural Networks [J].
Arifoglu, Damla ;
Bouchachia, Abdelhamid .
14TH INTERNATIONAL CONFERENCE ON MOBILE SYSTEMS AND PERVASIVE COMPUTING (MOBISPC 2017) / 12TH INTERNATIONAL CONFERENCE ON FUTURE NETWORKS AND COMMUNICATIONS (FNC 2017) / AFFILIATED WORKSHOPS, 2017, 110 :86-93
[5]   Landmark-based multimodal human action recognition [J].
Asteriadis, Stylianos ;
Daras, Petros .
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (03) :4505-4521
[6]   mHealthDroid: A novel framework for agile development of mobile health applications [J].
Banos, Oresti ;
Garcia, Rafael ;
Holgado-Terriza, Juan A. ;
Damas, Miguel ;
Pomares, Hector ;
Rojas, Ignacio ;
Saez, Alejandro ;
Villalonga, Claudia .
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8868 :91-98
[7]   Vision-based human activity recognition: a survey [J].
Beddiar, Djamila Romaissa ;
Nini, Brahim ;
Sabokrou, Mohammad ;
Hadid, Abdenour .
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (41-42) :30509-30555
[8]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[9]   On the use of ensemble of classifiers for accelerometer-based activity recognition [J].
Catal, Cagatay ;
Tufekci, Selin ;
Pirmit, Elif ;
Kocabag, Guner .
APPLIED SOFT COMPUTING, 2015, 37 :1018-1022
[10]   Deep Learning for Sensor-based Human Activity Recognition: Overview, Challenges, and Opportunities [J].
Chen, Kaixuan ;
Zhang, Dalin ;
Yao, Lina ;
Guo, Bin ;
Yu, Zhiwen ;
Liu, Yunhao .
ACM COMPUTING SURVEYS, 2021, 54 (04)