Residual attention mechanism and weighted feature fusion for multi-scale object detection

被引:0
|
作者
Jie Zhang
Qiye Qi
Huanlong Zhang
Qifan Du
Fengxian Wang
Xiaoping Shi
机构
[1] Zhengzhou University of Light Industry,College of Electrical and Information Engineering
[2] Harbin Institute of Technology,undefined
来源
关键词
Deep learning; Object detection; Residual attention mechanism; Weighted feature fusion;
D O I
暂无
中图分类号
学科分类号
摘要
Object detection is one of the critical problems in computer vision research, which is also an essential basis for understanding high-level semantic information of images. To improve object detection performance, an improved YOLOv3 multi-scale object detection method is proposed in this article. Firstly, a residual attention module is introduced into the neck of YOLOv3, which includes the channel attention module, spatial attention module, and skip connection. The residual attention module is applied to the three layers of features obtained from the backbone, making the output feature focus on the channels and regions related to the object. Secondly, an additional weight is proposed to add to each input feature in the top-down feature fusion stage of YOLOv3, the size of which is determined by the degree of contribution of each input feature to the output features. The experimental results on KITTI, PASCAL VOC, and bird’s nest datasets fully verify the effectiveness of the proposed method in object detection. The proposed method has significant value in electric power inspection and self-driving automobiles.
引用
收藏
页码:40873 / 40889
页数:16
相关论文
共 50 条
  • [41] Text Detection Algorithm Based on Multi-Scale Attention Feature Fusion
    She, Xiangyang
    Liu, Zhe
    Dong, Lihong
    Computer Engineering and Applications, 2024, 60 (01) : 198 - 206
  • [42] Multi-Scale Feature Attention Fusion for Image Splicing Forgery Detection
    Liang, Enji
    Zhang, Kuiyuan
    Hua, Zhongyun
    Jia, Xiaohua
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2025, 21 (01)
  • [43] Multi-scale Feature Fusion Object Detection Based on Swin Transformer
    Zhang, Ying
    Wu, Lin
    Deng, Huaxuan
    Hu, Jun
    Li, Xifan
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 1982 - 1987
  • [44] Hierarchical Feature Fusion With Text Attention For Multi-scale Text Detection
    Liu, Chao
    Zou, Yuexian
    Guan, Wenjie
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [45] Pedestrian detection algorithm based on multi-scale feature extraction and attention feature fusion
    Xia, Hao
    Ma, Jun
    Ou, Jiayu
    Lv, Xinyao
    Bai, Chengjie
    DIGITAL SIGNAL PROCESSING, 2022, 121
  • [46] Multi-scale salient object detection network combining an attention mechanism
    Liu, Di
    Guo, Jichang
    Wang, Yudong
    Zhang, Yi
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2022, 49 (04): : 118 - 126
  • [47] Multi-Scale Object Detection with the Pixel Attention Mechanism in a Complex Background
    Xiao, Jinsheng
    Guo, Haowen
    Yao, Yuntao
    Zhang, Shuhao
    Zhou, Jian
    Jiang, Zhijun
    REMOTE SENSING, 2022, 14 (16)
  • [48] Integrating attention mechanism and multi-scale feature extraction for fall detection
    Chen, Hao
    Gu, Wenye
    Zhang, Qiong
    Li, Xiujing
    Jiang, Xiaojing
    HELIYON, 2024, 10 (10)
  • [49] MSFFA: a multi-scale feature fusion and attention mechanism network for crowd counting
    Zhaoxin Li
    Shuhua Lu
    Yishan Dong
    Jingyuan Guo
    The Visual Computer, 2023, 39 : 1045 - 1056
  • [50] MSFFA: a multi-scale feature fusion and attention mechanism network for crowd counting
    Li, Zhaoxin
    Lu, Shuhua
    Dong, Yishan
    Guo, Jingyuan
    VISUAL COMPUTER, 2023, 39 (03): : 1045 - 1056