An improved small object detection method based on Yolo V3

被引:33
作者
Xianbao, Cheng [1 ,2 ]
Guihua, Qiu [1 ]
Yu, Jiang [1 ]
Zhaomin, Zhu [1 ]
机构
[1] Beibu Gulf Univ, Sch Elect & Informat Engn, Qinzhou 535011, Peoples R China
[2] Univ South Australia, Sch Informat Technol & Math Sci, Adelaide, SA 5095, Australia
关键词
Deep learning; YOLO V3; Sampling; Small object; Feature acquisition; CNN;
D O I
10.1007/s10044-021-00989-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, an improved algorithm based on Yolo V3 is proposed, which can effectively improve the accuracy of small target detection. First of all, the feature map acquisition network is improved. The image double-segmentation and bilinear upsampling network are used to replace the 2-step downsampling convolution network in the original network architecture, and the feature values of large and small objects are amplified. Secondly, a size recognition module is added to the input image to reduce the loss of morpheme features caused by no-feature value filling and enhance the recognition ability of small objects. Thirdly, in order to avoid the gradient fading of the network, the residual network element of the output network layer is added to enhance the feature channel of small object detection. Compared with Yolo V3, our algorithm improves the detection accuracy of small objects from 82.4 to 88.5%, the recall rate from 84.6 to 91.3%, and the average accuracy from 95.5 to 97.3%, respectively.
引用
收藏
页码:1347 / 1355
页数:9
相关论文
共 21 条
[1]  
Bappy JH, 2016, IEEE IMAGE PROC, P3658, DOI 10.1109/ICIP.2016.7533042
[2]  
Chen, 2019, U.S. Patent, Patent No. [10,268,947, 10268947]
[3]  
Eivazi S, 2019, P 11 ACM S EYE TRACK, V40
[4]  
Fengmei C., 2019, IEEE T NETW SCI ENG, V24, P464
[5]  
Fu C. -Y., Dssd: Deconvolutional single shot detector, DOI DOI 10.1109/CVPR.2016.90
[7]  
He K, P IEEE C COMP VIS PA, P770, DOI [DOI 10.1109/CVPR.2016.90, 10.1109/CVPR.2016.90]
[8]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[9]  
Hui S., 2019, COMPUT ENG APPL, V55, P213
[10]  
Jiafeng R., 2019, COMPUTER SYSTEMS APP, V28, P171, DOI DOI 10.15888/J.CNKI.CSA.007184