Improved YOLOX object detection algorithm based on gradient difference adaptive learning rate optimization

被引:0
|
作者
Song Y. [1 ]
Ge Q. [2 ,3 ,4 ]
Zhu J. [5 ]
Lu Z. [1 ]
机构
[1] School of Artificial Intelligence, School of Future Technology, Nanjing University of Information Science and Technology, Nanjing
[2] School of Automation, Nanjing University of Information Science and Technology, Nanjing
[3] Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology (CICAEET), Nanjing University of Information Science and Technology, Nanjing
[4] Jiangsu Province Engineering Research Center of Intelligent Meteorological Exploration Robot(C⁃IMER), Nanjing University of Information Science and Technology, Nanjing
[5] College of Information Engineering, Henan University of Science and Technology, Luoyang
基金
中国国家自然科学基金;
关键词
neural network optimization; object detection; PASCAL VOC; RSOD; YOLOX;
D O I
10.7527/S1000-6893.2022.27951
中图分类号
学科分类号
摘要
Object detection has always been one of the most challenging problems in the field of computer vision,and is widely used in the tasks such as face recognition,autonomous driving and traffic detection. To further improve the performance of current mainstream object detection algorithms,this paper proposes an improved object detection algorithm based on YOLOX,and carries out experiments on the standard PASCAL VOC 07+12 and RSOD datasets. The YOLOX object detection algorithm is improved mainly through data enhancement,improving network structure and loss function. At the same time,an adaptive learning rate optimization algorithm based on gradient difference is proposed to train the improved YOLOX algorithm,which is also suitable for optimization of other neural networks. Experiments are carried out on PASCAL VOC 07+12 standard data sets. Results show that the AP of the improved YOLOX-S algorithm is increased from 61. 63% to 66. 35% compared with that of the original YOLOX-S algorithm. The improvement effect is obvious. Experiments are also carried out on the RSOD standard data set. The results show that the AP of the improved YOLOX-S algorithm is increased from 69. 4% to 73. 2% on the RSOD data set,compared with those of other mainstream YOLO series algorithms. The improvement effect is also significant. Experiments show effective improvement of YOLOX’s object detection. © 2023 AAAS Press of Chinese Society of Aeronautics and Astronautics. All rights reserved.
引用
收藏
相关论文
共 38 条
  • [1] LI K Q,, CHEN Y,, LIU J C,, Et al., Survey of deep learning-based object detection algorithms[J], Computer Engineering, 48, 7, pp. 1-12, (2022)
  • [2] DARRELL T,, Et al., Rich feature hierarchies for accurate object detection and semantic segmentation, 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, (2014)
  • [3] ZHANG X Y,, REN S Q,, Et al., Spatial pyramid pooling in deep convolutional networks for visual recognition[J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 37, 9, pp. 1904-1916, (2015)
  • [4] GIRSHICK R., Fast R-CNN[C]// 2015 IEEE International Conference on Computer Vision(ICCV), pp. 1440-1448, (2016)
  • [5] Faster R-CNN:Towards real-time object detection with region proposal networks[J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 6, pp. 1137-1149, (2017)
  • [6] 2016 IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 779-788, (2016)
  • [7] LIU W,, ANGUELOV D,, ERHAN D,, Et al., SSD:Single shot MultiBox detector, European Conference on Computer Vision, pp. 21-37, (2016)
  • [8] LI H G, YU R N, DING W R., Research development of small object traching based on deep learning[J], Acta Aeronautica et Astronautica Sinica, 42, 7, (2021)
  • [9] 2017 IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 6517-6525, (2017)
  • [10] FARHADI A., YOLOv3:An incremental improvement[DB/OL], (2018)