Target detection algorithm based on multilayer attention mechanism-adaptive feature fusion network

被引:3
作者
An, Fengping [1 ]
Wang, Jianrong [2 ,3 ]
机构
[1] Huaiyin Normal Univ, Sch Phys & Elect Elect, Engn, Huaian 223300, Peoples R China
[2] Shanxi Univ, Sch Math Sci, Tianjin 030006, SX, Peoples R China
[3] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300072, Peoples R China
基金
中国博士后科学基金;
关键词
Target Detection; Multi-layer Attention Mechanism; Adaptive Feature Fusion Network; Node Attention; Feature Reinforcement; OBJECT DETECTION;
D O I
10.1007/s13042-023-01791-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Target images are complex and diverse under the influence of scale, occlusion and appearance factors in real scenes, and they affect the performance of target detection algorithms. They also make the existing target detection algorithms suffer from the following problems. On the one hand, the neurons in the target detection algorithm architecture cannot learn the complex interaction and semantic features inside the target image. On the other hand, the feature expression of different target images is insufficient and the channel reduction leads to the loss of position information and other problems. herefore, a multi-layer attention mechanism of considering both node and semantic level attention in the model architecture is proposed. In this method, the fusion of neighbors and semantic information is weighted, and node representations is learned under a hierarchical aggregation manner.Just because of this, it can improve the effectiveness and interpretability of the model, and solve the problem of complex interaction and rich semantic feature acquisition within images. Furthermore, we propose an adaptive feature fusion network which can adaptively filter the useless information of other layers and retain the feature information that is beneficial to target recognition. A feature enhancement module, is introduced to enhance the identifiability of the top-level target features of the feature network, and which can alleviate the problem of loss of target position. Finally, the extensive tests using PASCAL VOC and MSCOCO datasets, the experimental result shows that our method not only has the best recognition performance, but also has better stability and robustness.
引用
收藏
页码:2685 / 2695
页数:11
相关论文
共 43 条
  • [1] Measuring the Objectness of Image Windows
    Alexe, Bogdan
    Deselaers, Thomas
    Ferrari, Vittorio
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) : 2189 - 2202
  • [2] Object recognition algorithm based on optimized nonlinear activation function-global convolutional neural network
    An, Feng-Ping
    Liu, Jun-e
    Bai, Lei
    [J]. VISUAL COMPUTER, 2022, 38 (02) : 541 - 553
  • [3] Ghost Target Detection in 3D Radar Data using Point Cloud based Deep Neural Network
    Chamseddine, Mahdi
    Rambach, Jason
    Stricker, Didier
    Wasenmueller, Oliver
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10398 - 10403
  • [4] False-Alarm-Controllable Radar Detection for Marine Target Based on Multi Features Fusion via CNNs
    Chen, Xiaolong
    Su, Ningyuan
    Huang, Yong
    Guan, Jian
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (07) : 9099 - 9111
  • [5] Dynamic DETR: End-to-End Object Detection with Dynamic Attention
    Dai, Xiyang
    Chen, Yinpeng
    Yang, Jianwei
    Zhang, Pengchuan
    Yuan, Lu
    Zhang, Lei
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2968 - 2977
  • [6] Deshpande A, 2021, NEUROSN INFORM, V1
  • [7] The Pascal Visual Object Classes (VOC) Challenge
    Everingham, Mark
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
  • [8] Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector
    Fan, Qi
    Zhuo, Wei
    Tang, Chi-Keung
    Tai, Yu-Wing
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4012 - 4021
  • [9] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [10] Jais I. K. M., 2019, Knowl. Eng. Data Sci., V2, P41, DOI DOI 10.17977/UM018V2I12019P41-46