Higher efficient YOLOv7: a one-stage method for non-salient object detection

被引:0
作者
Chengang Dong
Yuhao Tang
Liyan Zhang
机构
[1] Nanjing University of Aeronautics and Astronautics,
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Non-salient; Object Detection; Attention Mechanisms; YOLOv7;
D O I
暂无
中图分类号
学科分类号
摘要
Compared to the remarkable progress within the discipline of object detection in recent years, real-time detection of non-salient objects remains a challenging research task. However, most existing detection methods fail to adequately extract the global features of targets, leading to suboptimal performance when dealing with non-salient objects. In this paper, we propose a unified framework called Higher efficient (He)-YOLOv7 to enhance the detection capability of YOLOv7 for non-salient objects.Firstly, we introduce an refined Squeeze and Excitation Network (SENet) to dynamically adjust the weights of feature channels, thereby enhancing the model's perception of non-salient objects. Secondly, we design an Angle Intersection over Union (AIoU) loss function that considers relative positional information, optimizing the widely used Complete Intersection over Union (CIoU) loss function in YOLOv7. This significantly accelerates the model's convergence. Moreover, He-YOLOv7 adopts a blended data augmentation strategy to simulate occlusion among objects, further improving the model's ability to filter out noise information and enhancing its robustness. Comparison of experimental results demonstrates a significant improvement of 2.4% mean Average Precision (mAP) on the Microsoft Common Objects in Context (MS COCO) dataset and a notable enhancement of 1.2% mAP on the PASCAL VOC dataset. Simultaneously, our approach demonstrates comparable performance to state-of-the-art real-time object detection methods.
引用
收藏
页码:42257 / 42283
页数:26
相关论文
共 48 条
  • [1] Krizhevsky A(2017)ImageNet classification with deep convolutional neural networks Commun ACM 60 84-90
  • [2] Sutskever I(2015)Faster R-CNN: Towards real-time object detection with region proposal networks Adv Neural Inf Process Syst 28 91-99
  • [3] Hinton GE(2020)Self-adaptive training: beyond empirical risk minimization Adv Neural Inf Process Sys 33 19365-19376
  • [4] Ren S(2022)DDH-YOLOv5: improved YOLOv5 based on Double IoU-aware Decoupled Head for object detection J Real-Time Image Proc 19 1023-1033
  • [5] He K(2020)RoI transformer: A joint detection and classification network for object detection IEEE Trans Pattern Anal Mach Intell 43 1941-1954
  • [6] Girshick R(2020)Global context module with two complementary attention mechanisms for object detection IEEE Trans Image Process 29 3702-3712
  • [7] Sun J(2022)Manhattan-distance IOU loss for fast and accurate bounding box regression and object detection Neurocomputing 500 99-114
  • [8] Huang L(2021)Lung nodule detection based on faster R-CNN framework Comput Methods Programs Biomed 200 728-737
  • [9] Zhang C(2023)Target location detection of mobile robots based on R-FCN deep convolutional neural network Int J Syst Assur Eng Manag 14 1-11
  • [10] Zhang H(2020)RetinaNet with difference channel attention and adaptively spatial feature fusion for steel surface defect detection IEEE Trans Instrum Meas 70 11314-5631