Attention Mechanism and Detection Box Information Based Real-time Multi-Object Vehicle Detection

被引:0
|
作者
Wu H. [1 ,2 ,3 ,4 ]
Wu W. [5 ]
Sun X. [2 ]
Zhong J. [1 ]
Cao F. [1 ]
机构
[1] School of Computer Science and Technology, Hefei Normal University, Hefei
[2] School of Information and Control Engineering, China University of Mining and Technology, Xuzhou
[3] Universities Joint Key Laboratory of Photoelectric Detection Science and Technology in Anhui Province, Hefei
[4] Key Laboratory of Philosophy and Social Science of Anhui Province on Adolescent Mental Health and Crisis Intelligence Intervention, Hefei
[5] School of Economics and Trade, Anhui Business and Technology College, Hefei
关键词
AIoU Loss; attention mechanism; CS-NMS; multi-object detec-tion; YOLOv5s;
D O I
10.20532/cit.2022.1005718
中图分类号
学科分类号
摘要
Ensuring both the accuracy of vehicle target detection and meeting real-time requirements is crucial in traffic videos. The YOLOv5s target detection frame-work, known for its accuracy and efficiency, has at-tracted attention in academic circles. However, there are still some features that can be optimized. First of all, the detection subnet of the YOLOv5s framework cannot smoothly convert complex feature maps into relatively sparse target prediction boxes. To solve this, we integrate a self-attention-based gating mechanism into the detection subnet, forming the YOLOv5s-SAG network. Secondly, the loss function of CIoU used by YOLOv5s pays insufficient attention to the overlap-ping area of the detection frame, which can be used as metric for measuring target detection effectiveness. We add the loss term of area ratio to CIoU to further improve the modeling ability. Finally, the current multi-class Non-Maximum Suppression algorithm can cause high overlap of multi-class detection frames. To improve it, we propose a multi-class CS-NMS algorithm based on category suppression. Experimental results show an approximately 8% improvement in the mAP50 index on the UA-DETRAC dataset compared with YOLOv5s. The proposed algorithm also achieves better detection results compared to mainstream target detection algorithms and meets the real-time requirements of traffic video analysis. © 2022, University of Zagreb Faculty of Electrical Engineering and Computing. All rights reserved.
引用
收藏
页码:239 / 256
页数:17
相关论文
共 50 条
  • [1] Real-time multi-object detection model for cracks and deformations based on deep learning
    Xu, Gang
    Yue, Qingrui
    Liu, Xiaogang
    ADVANCED ENGINEERING INFORMATICS, 2024, 61
  • [2] Multi-object Detection Based on Deep Learning in Real Classrooms
    Shao, Benchi
    Jiang, Fei
    Shen, Ruimin
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 352 - 359
  • [3] Region Boosting for Real-Time Object Detection Using Multi-Dimensional Attention
    Chen, Jinlong
    Xu, Kejian
    Ning, Yi
    Xu, Zhi
    IEEE ACCESS, 2024, 12 : 171634 - 171643
  • [4] MULTI-OBJECT TRACKING AS ATTENTION MECHANISM
    Fukui, Hiroshi
    Miyagawa, Taiki
    Morishita, Yusuke
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 505 - 509
  • [5] Real-time realization of infrared ship detection with an attention mechanism
    Zeng, NZ
    Wang, YH
    Zhang, TX
    MULTISPECTRAL AND HYPERSPECTRAL IMAGE ACQUISITION AND PROCESSING, 2001, 4548 : 269 - 274
  • [6] Real-Time Lane Detection by Using Biologically Inspired Attention Mechanism to Learn Contextual Information
    Zhang, Lu
    Jiang, Fengling
    Kong, Bin
    Yang, Jing
    Wang, Can
    COGNITIVE COMPUTATION, 2021, 13 (05) : 1333 - 1344
  • [7] Real-Time Lane Detection by Using Biologically Inspired Attention Mechanism to Learn Contextual Information
    Lu Zhang
    Fengling Jiang
    Bin Kong
    Jing Yang
    Can Wang
    Cognitive Computation, 2021, 13 : 1333 - 1344
  • [8] SMALL OBJECT DETECTION ALGORITHM BASED ON CONTEXT INFORMATION AND ATTENTION MECHANISM
    Zhong Hang
    Li Fan
    Kuang Ping
    Gu Xiaofeng
    He Mingyun
    Tang Heng
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [9] Multi-Modal Attention Guided Real-Time Lane Detection
    Zhang, Xinyu
    Gong, Yan
    Li, Zhiwei
    Liu, Xuan
    Pan, Shuyue
    Li, Jun
    2021 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2021), 2021, : 146 - 153
  • [10] Composite Backbone Small Object Detection Based on Context and Multi-Scale Information with Attention Mechanism
    Jing, Xinhan
    Liu, Xuesong
    Liu, Baolin
    MATHEMATICS, 2024, 12 (05)