Attention Mechanism and Detection Box Information Based Real-time Multi-Object Vehicle Detection

被引:0
|
作者
Wu H. [1 ,2 ,3 ,4 ]
Wu W. [5 ]
Sun X. [2 ]
Zhong J. [1 ]
Cao F. [1 ]
机构
[1] School of Computer Science and Technology, Hefei Normal University, Hefei
[2] School of Information and Control Engineering, China University of Mining and Technology, Xuzhou
[3] Universities Joint Key Laboratory of Photoelectric Detection Science and Technology in Anhui Province, Hefei
[4] Key Laboratory of Philosophy and Social Science of Anhui Province on Adolescent Mental Health and Crisis Intelligence Intervention, Hefei
[5] School of Economics and Trade, Anhui Business and Technology College, Hefei
关键词
AIoU Loss; attention mechanism; CS-NMS; multi-object detec-tion; YOLOv5s;
D O I
10.20532/cit.2022.1005718
中图分类号
学科分类号
摘要
Ensuring both the accuracy of vehicle target detection and meeting real-time requirements is crucial in traffic videos. The YOLOv5s target detection frame-work, known for its accuracy and efficiency, has at-tracted attention in academic circles. However, there are still some features that can be optimized. First of all, the detection subnet of the YOLOv5s framework cannot smoothly convert complex feature maps into relatively sparse target prediction boxes. To solve this, we integrate a self-attention-based gating mechanism into the detection subnet, forming the YOLOv5s-SAG network. Secondly, the loss function of CIoU used by YOLOv5s pays insufficient attention to the overlap-ping area of the detection frame, which can be used as metric for measuring target detection effectiveness. We add the loss term of area ratio to CIoU to further improve the modeling ability. Finally, the current multi-class Non-Maximum Suppression algorithm can cause high overlap of multi-class detection frames. To improve it, we propose a multi-class CS-NMS algorithm based on category suppression. Experimental results show an approximately 8% improvement in the mAP50 index on the UA-DETRAC dataset compared with YOLOv5s. The proposed algorithm also achieves better detection results compared to mainstream target detection algorithms and meets the real-time requirements of traffic video analysis. © 2022, University of Zagreb Faculty of Electrical Engineering and Computing. All rights reserved.
引用
收藏
页码:239 / 256
页数:17
相关论文
共 50 条
  • [11] Object Detection Model Based on Attention Mechanism
    Han, Mengxue
    Tang, Xiangyan
    Yang, Yue
    Huang, Zhennan
    BIG DATA AND SECURITY, ICBDS 2023, PT I, 2024, 2099 : 74 - 88
  • [12] AG-YOLO: Attention-guided network for real-time object detection
    Hangyu Zhu
    Libo Sun
    Wenhu Qin
    Feng Tian
    Multimedia Tools and Applications, 2024, 83 : 28197 - 28213
  • [13] AG-YOLO: Attention-guided network for real-time object detection
    Zhu, Hangyu
    Sun, Libo
    Qin, Wenhu
    Tian, Feng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 28197 - 28213
  • [14] Multi-scale vehicle and pedestrian detection algorithm based on attention mechanism
    Li J.-Y.
    Yang J.
    Kong B.
    Wang C.
    Zhang L.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2021, 29 (06): : 1448 - 1458
  • [15] Infrared small object detection based on attention mechanism
    Li, Junyu
    Liu, Qiankun
    Fu, Ying
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (14):
  • [16] A Distance-Based Attention Mechanism for Object Detection
    Bai, Xiao
    Liang, Chengzhi
    Zhou, Jianqun
    2022 11TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS (ICCCAS 2022), 2022, : 246 - 250
  • [17] Object detection in real-time video surveillance using attention based transformer-YOLOv8 model
    Nimma, Divya
    Al-Omari, Omaia
    Pradhan, Rahul
    Ulmas, Zoirov
    Krishna, R. V. V.
    El-Ebiary, Ts. Yousef A. Baker
    Rao, Vuda Sreenivasa
    ALEXANDRIA ENGINEERING JOURNAL, 2025, 118 : 482 - 495
  • [18] Real-time dense small object detection algorithm based on multi-modal tea shoots
    Shuai, Luyu
    Chen, Ziao
    Li, Zhiyong
    Li, Hongdan
    Zhang, Boda
    Wang, Yuchao
    Mu, Jiong
    FRONTIERS IN PLANT SCIENCE, 2023, 14
  • [19] Recurrent Cross Attention Mechanism for Improved Robustness and Real-Time Performance in Lane Detection
    Huang, Bingqiang
    Wang, Shiqian
    Fei, Zhengshun
    Xiang, Xinjian
    Tian, Shaohua
    Tang, Fuying
    Yuan, Tianshun
    IEEE ACCESS, 2024, 12 : 126376 - 126388
  • [20] Multi-orientation Saliency Features Fusion Based Multi-object Detection
    Lu, Hong
    Tang, Hao
    Fei, Shumin
    Cao, Weifeng
    EIGHTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2016), 2016, 10033