Real-Time Target Detection Method Based on Lightweight Convolutional Neural Network

被引:71
作者
Yun, Juntong [1 ,2 ]
Jiang, Du [1 ,3 ,4 ]
Liu, Ying [2 ,4 ]
Sun, Ying [1 ,3 ,4 ]
Tao, Bo [1 ,3 ,4 ]
Kong, Jianyi [2 ,3 ,4 ]
Tian, Jinrong [1 ,2 ]
Tong, Xiliang [2 ,4 ]
Xu, Manman [1 ,2 ,3 ]
Fang, Zifan [5 ]
机构
[1] Wuhan Univ Sci & Technol, Key Lab Met Equipment & Control Technol, Minist Educ, Wuhan, Peoples R China
[2] Wuhan Univ Sci & Technol, Res Ctr Biomimet Robot & Intelligent Measurement &, Wuhan, Peoples R China
[3] Wuhan Univ Sci & Technol, Hubei Key Lab Mech Transmiss & Mfg Engn, Wuhan, Peoples R China
[4] Wuhan Univ Sci & Technol, Precis Mfg Res Inst, Wuhan, Peoples R China
[5] China Three Gorges Univ, Hubei Key Lab Hydroelect Machinery Design & Mainte, Yichang, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; target detection; MobileNets-SSD; depthwise separable convolution; residual module; GESTURE RECOGNITION;
D O I
10.3389/fbioe.2022.861286
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The continuous development of deep learning improves target detection technology day by day. The current research focuses on improving the accuracy of target detection technology, resulting in the target detection model being too large. The number of parameters and detection speed of the target detection model are very important for the practical application of target detection technology in embedded systems. This article proposed a real-time target detection method based on a lightweight convolutional neural network to reduce the number of model parameters and improve the detection speed. In this article, the depthwise separable residual module is constructed by combining depthwise separable convolution and non-bottleneck-free residual module, and the depthwise separable residual module and depthwise separable convolution structure are used to replace the VGG backbone network in the SSD network for feature extraction of the target detection model to reduce parameter quantity and improve detection speed. At the same time, the convolution kernels of 1 x 3 and 3 x 1 are used to replace the standard convolution of 3 x 3 by adding the convolution kernels of 1 x 3 and 3 x 1, respectively, to obtain multiple detection feature graphs corresponding to SSD, and the real-time target detection model based on a lightweight convolutional neural network is established by integrating the information of multiple detection feature graphs. This article used the self-built target detection dataset in complex scenes for comparative experiments; the experimental results verify the effectiveness and superiority of the proposed method. The model is tested on video to verify the real-time performance of the model, and the model is deployed on the Android platform to verify the scalability of the model.
引用
收藏
页数:13
相关论文
共 72 条
[1]   Improved single shot multibox detector target detection method based on deep feature fusion [J].
Bai, Dongxu ;
Sun, Ying ;
Tao, Bo ;
Tong, Xiliang ;
Xu, Manman ;
Jiang, Guozhang ;
Chen, Baojia ;
Cao, Yongcheng ;
Sun, Nannan ;
Li, Zeshen .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (04)
[2]  
Cao Y., 2018, LOW ALTITUDE ARMORED
[3]   Monitoring and Recognizing Enterprise Public Opinion from High-Risk Users Based on User Portrait and Random Forest Algorithm [J].
Chen, Tinggui ;
Yin, Xiaohua ;
Peng, Lijuan ;
Rong, Jingtao ;
Yang, Jianjun ;
Cong, Guodong .
AXIOMS, 2021, 10 (02)
[4]   Analysis of User Needs on Downloading Behavior of English Vocabulary APPs Based on Data Mining for Online Comments [J].
Chen, Tinggui ;
Peng, Lijuan ;
Yang, Jianjun ;
Cong, Guodong .
MATHEMATICS, 2021, 9 (12)
[5]   Combining Public Opinion Dissemination with Polarization Process Considering Individual Heterogeneity [J].
Chen, Tinggui ;
Rong, Jingtao ;
Yang, Jianjun ;
Cong, Guodong ;
Li, Gongfa .
HEALTHCARE, 2021, 9 (02)
[6]   Gesture recognition based on surface electromyography-featureimage [J].
Cheng, Yangwei ;
Li, Gongfa ;
Yu, Mingchao ;
Jiang, Du ;
Yun, Juntong ;
Liu, Ying ;
Liu, Yibo ;
Chen, Disi .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (06)
[7]   Visualization of activated muscle area based on sEMG [J].
Cheng, Yangwei ;
Li, Gongfa ;
Li, Jiahan ;
Sun, Ying ;
Jiang, Guozhang ;
Zeng, Fei ;
Zhao, Haoyi ;
Chen, Disi .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (03) :2623-2634
[8]   Gesture recognition based on multi-modal feature weight [J].
Duan, Haojie ;
Sun, Ying ;
Cheng, Wentao ;
Jiang, Du ;
Yun, Juntong ;
Liu, Ying ;
Liu, Yibo ;
Zhou, Dalin .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (05)
[9]   Intelligent Detection of Steel Defects Based on Improved Split Attention Networks [J].
Hao, Zhiqiang ;
Wang, Zhigang ;
Bai, Dongxu ;
Tao, Bo ;
Tong, Xiliang ;
Chen, Baojia .
FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 9
[10]   Towards the steel plate defect detection: Multidimensional feature information extraction and fusion [J].
Hao, Zhiqiang ;
Wang, Zhigang ;
Bai, Dongxu ;
Zhou, Shiyang .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (21)