Real-Time Target Detection Method Based on Lightweight Convolutional Neural Network

被引:71
作者
Yun, Juntong [1 ,2 ]
Jiang, Du [1 ,3 ,4 ]
Liu, Ying [2 ,4 ]
Sun, Ying [1 ,3 ,4 ]
Tao, Bo [1 ,3 ,4 ]
Kong, Jianyi [2 ,3 ,4 ]
Tian, Jinrong [1 ,2 ]
Tong, Xiliang [2 ,4 ]
Xu, Manman [1 ,2 ,3 ]
Fang, Zifan [5 ]
机构
[1] Wuhan Univ Sci & Technol, Key Lab Met Equipment & Control Technol, Minist Educ, Wuhan, Peoples R China
[2] Wuhan Univ Sci & Technol, Res Ctr Biomimet Robot & Intelligent Measurement &, Wuhan, Peoples R China
[3] Wuhan Univ Sci & Technol, Hubei Key Lab Mech Transmiss & Mfg Engn, Wuhan, Peoples R China
[4] Wuhan Univ Sci & Technol, Precis Mfg Res Inst, Wuhan, Peoples R China
[5] China Three Gorges Univ, Hubei Key Lab Hydroelect Machinery Design & Mainte, Yichang, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; target detection; MobileNets-SSD; depthwise separable convolution; residual module; GESTURE RECOGNITION;
D O I
10.3389/fbioe.2022.861286
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The continuous development of deep learning improves target detection technology day by day. The current research focuses on improving the accuracy of target detection technology, resulting in the target detection model being too large. The number of parameters and detection speed of the target detection model are very important for the practical application of target detection technology in embedded systems. This article proposed a real-time target detection method based on a lightweight convolutional neural network to reduce the number of model parameters and improve the detection speed. In this article, the depthwise separable residual module is constructed by combining depthwise separable convolution and non-bottleneck-free residual module, and the depthwise separable residual module and depthwise separable convolution structure are used to replace the VGG backbone network in the SSD network for feature extraction of the target detection model to reduce parameter quantity and improve detection speed. At the same time, the convolution kernels of 1 x 3 and 3 x 1 are used to replace the standard convolution of 3 x 3 by adding the convolution kernels of 1 x 3 and 3 x 1, respectively, to obtain multiple detection feature graphs corresponding to SSD, and the real-time target detection model based on a lightweight convolutional neural network is established by integrating the information of multiple detection feature graphs. This article used the self-built target detection dataset in complex scenes for comparative experiments; the experimental results verify the effectiveness and superiority of the proposed method. The model is tested on video to verify the real-time performance of the model, and the model is deployed on the Android platform to verify the scalability of the model.
引用
收藏
页数:13
相关论文
共 72 条
[41]   Grasping force prediction based on sEMG signals [J].
Ma, Ruyi ;
Zhang, Leilei ;
Li, Gongfa ;
Jiang, Du ;
Xu, Shuang ;
Chen, Disi .
ALEXANDRIA ENGINEERING JOURNAL, 2020, 59 (03) :1135-1147
[42]   Fast-armored target detection based on multi-scale representation and guided anchor [J].
Meng, Fan-jie ;
Wang, Xin-qing ;
Shao, Fa-ming ;
Wang, Dong ;
Hu, Xiao-dong .
DEFENCE TECHNOLOGY, 2020, 16 (04) :922-932
[43]  
Ming-Ming L.I., 2019, MULTITARGET DETECTIO
[44]   A New Image Recognition and Classification Method Combining Transfer Learning Algorithm and MobileNet Model for Welding Defects [J].
Pan, Haihong ;
Pang, Zaijun ;
Wang, Yaowei ;
Wang, Yijue ;
Chen, Lin .
IEEE ACCESS, 2020, 8 :119951-119960
[45]  
[仇荣超 Qiu Rongchao], 2019, [兵工学报, Acta Armamentarii], V40, P1171
[46]   A Review on Human-Computer Interaction and Intelligent Robots [J].
Ren, Fuji ;
Bao, Yanwei .
INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2020, 19 (01) :5-47
[47]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149
[48]   MobileNetV2: Inverted Residuals and Linear Bottlenecks [J].
Sandler, Mark ;
Howard, Andrew ;
Zhu, Menglong ;
Zhmoginov, Andrey ;
Chen, Liang-Chieh .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4510-4520
[49]   Fully Convolutional Networks for Semantic Segmentation [J].
Shelhamer, Evan ;
Long, Jonathan ;
Darrell, Trevor .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) :640-651
[50]   Path Planning Optimization of Intelligent Vehicle Based on Improved Genetic and Ant Colony Hybrid Algorithm [J].
Shi, Kangjing ;
Huang, Li ;
Jiang, Du ;
Sun, Ying ;
Tong, Xiliang ;
Xie, Yuanming ;
Fang, Zifan .
FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10