Micro-network-based deep convolutional neural network for human activity recognition from realistic and multi-view visual data

被引：0

作者：

Arati Kushwaha

Ashish Khare

Om Prakash

机构：

[1] University of Allahabad,Department of Electronics & Communication

[2] HNB Garhwal University,Department of Computer Science & Engineering

来源：

Neural Computing and Applications | 2023年 / 35卷

关键词：

Convolutional neural network; Human activity recognition; Micro-network; Softmax classifier;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In the recent past, deep convolutional neural network (DCNN) has been used in majority of state-of-the-art methods due to its remarkable performance in number of computer vision applications. However, DCNN are computationally expensive and requires more resources as well as computational time. Also, deeper architectures are prone to overfitting problem, while small-size dataset is used. To address these limitations, we propose a simple and computationally efficient deep convolutional neural network (DCNN) architecture based on the concept multiscale processing for human activity recognition. We increased the width and depth of the network by carefully crafting the design of network, which results in improved utilization of computational resources. First, we designed a small micro-network with varying receptive field size convolutional kernels (1×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times$$\end{document}1, 3×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times$$\end{document}3, and 5×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times$$\end{document}5) for extraction of unique discriminative information of human objects having variations in object size, pose, orientation, and view. Then, the proposed DCNN architecture is designed by stacking repeated building blocks of small micro-networks with same topology. Here, we factorize the larger convolutional operation in stack of smaller convolutional operations to make the network computationally efficient. The softmax classifier is used for activity classification. Advantage of the proposed architecture over standard deep architectures is its computational efficiency and flexibility to use with both small as well as large size datasets. To evaluate the effectiveness of the proposed architecture, several extensive experiments are conducted by using publically available datasets, namely UCF sports, IXMAS, YouTube, TV-HI, HMDB51, and UCF101 datasets. The activity recognition results have shown outperformance of the proposed method over other existing state-of-the-art methods.

引用

页码：13321 / 13341

页数：20

共 50 条

[31] A multi-view convolutional neural network based on cross-connection and residual-wider
Wenhua Chen
Wenguang Zhang
Wei Wang
[J]. Applied Intelligence, 2023, 53 : 14316 - 14328
[32] Wearable Sensors-Based Human Activity Recognition with Deep Convolutional Neural Network and Fuzzy Classification
Serpush, Fatemeh
Menhaj, Mohammad Bagher
Masoumi, Behrooz
Karasfi, Babak
[J]. WIRELESS PERSONAL COMMUNICATIONS, 2023, 133 (02) : 889 - 911
[33] Wearable Sensors-Based Human Activity Recognition with Deep Convolutional Neural Network and Fuzzy Classification
Fatemeh Serpush
Mohammad Bagher Menhaj
Behrooz Masoumi
Babak Karasfi
[J]. Wireless Personal Communications, 2023, 133 : 889 - 911
[34] Real Time Human Activity Recognition Using Convolutional Neural Network and Deep Gated Recurrent Unit
Fajar, Rasyid
Suciati, Nanik
Navastara, Dini Adni
[J]. 2020 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICELTICS 2020), 2020, : 58 - 63
[35] Multi-model weighted voting method based on convolutional neural network for human activity recognition
Ouyang, Kangyue
Pan, Zhongliang
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (29) : 73305 - 73328
[36] Multi-View Deep Network: A Deep Model Based on Learning Features From Heterogeneous Neural Networks for Sentiment Analysis
Sadr, Hossein
Pedram, Mir Mohsen
Teshnehlab, Mohammad
[J]. IEEE ACCESS, 2020, 8 : 86984 - 86997
[37] Recognition of emotion in music based on deep convolutional neural network
Rajib Sarkar
Sombuddha Choudhury
Saikat Dutta
Aneek Roy
Sanjoy Kumar Saha
[J]. Multimedia Tools and Applications, 2020, 79 : 765 - 783
[38] Traffic Sign Recognition Based on Deep Convolutional Neural Network
Yin, Shihao
Deng, Jicai
Zhang, Dawei
Du, Jingyuan
[J]. COMPUTER VISION, PT I, 2017, 771 : 685 - 695
[39] Recognition of emotion in music based on deep convolutional neural network
Sarkar, Rajib
Choudhury, Sombuddha
Dutta, Saikat
Roy, Aneek
Saha, Sanjoy Kumar
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (1-2) : 765 - 783
[40] Vehicle category recognition based on deep convolutional neural network
Yuan G.-P.
Tang Y.-P.
Han W.-M.
Chen Q.
[J]. Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2018, 52 (04): : 694 - 702

← 1 2 3 4 5 →