Object Detection Enhancement Algorithm Based on Curriculum Learning

被引：0

作者：

Dai L. ^{[1
]}

Huang S. ^{[1
]}

机构：

[1] College of Electrical Engineering, Sichuan University, Chengdu

来源：

| 1600年 / Institute of Computing Technology卷 / 33期

关键词：

Curriculum learning; Feature extraction; Object detection;

D O I：

10.3724/SP.J.1089.2021.18401

中图分类号：

学科分类号：

摘要：

The performance of object detection algorithms depends on both dataset distribution and network design of fea-ture extraction. Starting from these two points, we firstly explore the potential inherent reasons that lead to low detection accuracy of small object by analyzing the distribution of object attributes at various scales in the COCO 2017 dataset, and propose copy and paste (CP) module accordingly, which adjusts the distribution of small object offline, on the one hand, upsampling the pictures containing small objects, on the other hand, copy-ing and pasting the small objects in the pictures. Then, to further improve network feature extraction ability, in-spired by the idea of curriculum learning (CL), we propose CL layer, which uses ground truth labels to guide the learning process, and CL factor to control the learning intensity, the features of objects are enhanced to facilitate network feature extraction. We deploy the CP module on the COCO 2017 dataset and embed the CL layer in the CenterNet network to conduct multiple sets of comparative experiments, and use average detection accuracy, small object detection accuracy, medium object detection accuracy, and large object detection accuracy as evaluation indicators. The experimental results prove the effectiveness of CP module and CL layer. © 2021, Beijing China Science Journal Publishing Co. Ltd. All right reserved.

引用

页码：278 / 286

页数：8

共 15 条

[1] Liu L, Ouyang W L, Wang X G, Et al., Deep learning for generic object detection: a survey, International Journal of Computer Vision, 128, 2, pp. 261-318, (2020)
[2] Zhou Z X, Shi Z W, Guo Y H, Et al., Object detection in 20 years: a survey
[3] Yuan Gonglin, Hou Jing, Yin Kuiying, Night-time aerial image vehicle recognition technology based on transfer learning and image enhancement, Journal of Computer-Aided Design & Computer Graphics, 31, 3, pp. 467-473, (2019)
[4] Liu W, Anguelov D, Erhan D, Et al., SSD: single shot multibox detector, Proceedings of European Conference on Computer Vision, pp. 21-37, (2016)
[5] Lin T Y, Goyal P, Girshick R, Et al., Focal loss for dense object detection, Proceedings of the IEEE International Conference on Computer Vision, pp. 2999-3007, (2017)
[6] Redmon J, Divvala S, Girshick R, Et al., You only look once: unified, real-time object detection, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, (2016)
[7] Redmon J, Farhadi A., YOLO9000: better, faster, strong-er, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6517-6525, (2017)
[8] Redmon J, Farhadi A., YOLOv3: an incremental improvement
[9] Lin T Y, Dollar P, Girshick R, Et al., Feature pyramid networks for object detection, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 936-944, (2017)
[10] Lin T Y, Maire M, Belongie S, Et al., Microsoft COCO: common objects in context, Proceedings of European Conference on Computer Vision, pp. 740-755, (2014)

← 1 2 →