Micro-network Based Convolutional Neural Network with Integration of Multilayer Feature Fusion Strategy for Human Activity Recognition

被引:3
作者
Kushwaha, Arati [1 ]
Khare, Manish [2 ]
Khare, Ashish [1 ]
机构
[1] Univ Allahabad, Dept Elect & Commun, Allahabad, Uttar Pradesh, India
[2] Dhirubhai Ambani Inst Informat & Commun Technol, Gandhinagar, India
关键词
Human activity recognition; convolutional neural network; micro-network; multilayer feature fusion strategy; softmax classifier; IMAGE;
D O I
10.1142/S0218213022500452
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks (CNN) have shown remarkable performance in enormous computer vision applications over the years, and many works have been done for human activity recognition (HAR) using CNN. However, most of deep learning-based architectures require large training data and plenty of computational resources. Therefore, we proposed a simple and efficient deep learning model for human activity recognition, which works on complex visual data and require less computational resources. In the proposed work, we designed a novel CNN architecture by stacking the repeated components called micro-networks to incorporate multiscale processing in the network layers. We have used a feature fusion strategy to pass the previous layer's abstract complementary information to the next adjacent layer, believing that each layer encapsulates specific feature maps. Therefore, the hidden complementary information can potentially enhance the feature discrimination capacity of the network and help in learning the network. The proposed architecture is fully trained from scratch with stochastic gradient descent (SGD) optimizer at 0.05 initial learning rate and a softmax classifier is used for activity recognition. The merit of the proposed method over standard deep learning models is its computational efficiency in terms of learnable parameters and computational resources. The proposed model gives good performance on small as well as large size datasets. For authentication of the proposed method several extensive experiments are conducted on publically available datasets, namely UCF-101, HMDB-51, YouTube, and IXMAS datasets. The results have shown the outperformance of the proposed method over the existing state-of-the-art methods.
引用
收藏
页数:22
相关论文
共 60 条
[1]   Human action recognition using three orthogonal planes with unsupervised deep convolutional neural network [J].
Abdelbaky, Amany ;
Aly, Saleh .
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (13) :20019-20043
[2]   A framework of human action recognition using length control features fusion and weighted entropy-variances based feature selection [J].
Afza, Farhat ;
Khan, Muhammad Attique ;
Sharif, Muhammad ;
Kadry, Seifedine ;
Manogaran, Gunasekaran ;
Saba, Tanzila ;
Ashraf, Imran ;
Damasevicius, Robertas .
IMAGE AND VISION COMPUTING, 2021, 106
[3]  
Almaadeed N, 2021, Arxiv, DOI arXiv:1907.11272
[4]  
Asghari-Esfeden S, 2020, IEEE WINT CONF APPL, P546, DOI 10.1109/WACV45572.2020.9093500
[5]   Keyframe extraction using Pearson correlation coefficient and color moments [J].
Bommisetty, Reddy Mounika ;
Prakash, Om ;
Khare, Ashish .
MULTIMEDIA SYSTEMS, 2020, 26 (03) :267-299
[6]   Large-Scale Machine Learning with Stochastic Gradient Descent [J].
Bottou, Leon .
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186
[7]   3D RANs: 3D Residual Attention Networks for action recognition [J].
Cai, Jiahui ;
Hu, Jianguo .
VISUAL COMPUTER, 2020, 36 (06) :1261-1270
[8]   Depth-based end-to-end deep network for human action recognition [J].
Chaudhary, Sachin ;
Murala, Subrahmanyam .
IET COMPUTER VISION, 2019, 13 (01) :15-22
[9]  
Chen D., 2021, IEEE T NEUR NET LEAR
[10]   A Novel Supertwisting Zeroing Neural Network With Application to Mobile Robot Manipulators [J].
Chen, Dechao ;
Li, Shuai ;
Wu, Qing .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (04) :1776-1787