Target detection algorithm based on multilayer attention mechanism-adaptive feature fusion network

被引：3

作者：

An, Fengping ^{[1
]}

Wang, Jianrong ^{[2
,3
]}

机构：

[1] Huaiyin Normal Univ, Sch Phys & Elect Elect, Engn, Huaian 223300, Peoples R China

[2] Shanxi Univ, Sch Math Sci, Tianjin 030006, SX, Peoples R China

[3] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300072, Peoples R China

来源：

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS | 2023年 / 14卷 / 08期

基金：

中国博士后科学基金;

关键词：

Target Detection; Multi-layer Attention Mechanism; Adaptive Feature Fusion Network; Node Attention; Feature Reinforcement; OBJECT DETECTION;

D O I：

10.1007/s13042-023-01791-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Target images are complex and diverse under the influence of scale, occlusion and appearance factors in real scenes, and they affect the performance of target detection algorithms. They also make the existing target detection algorithms suffer from the following problems. On the one hand, the neurons in the target detection algorithm architecture cannot learn the complex interaction and semantic features inside the target image. On the other hand, the feature expression of different target images is insufficient and the channel reduction leads to the loss of position information and other problems. herefore, a multi-layer attention mechanism of considering both node and semantic level attention in the model architecture is proposed. In this method, the fusion of neighbors and semantic information is weighted, and node representations is learned under a hierarchical aggregation manner.Just because of this, it can improve the effectiveness and interpretability of the model, and solve the problem of complex interaction and rich semantic feature acquisition within images. Furthermore, we propose an adaptive feature fusion network which can adaptively filter the useless information of other layers and retain the feature information that is beneficial to target recognition. A feature enhancement module, is introduced to enhance the identifiability of the top-level target features of the feature network, and which can alleviate the problem of loss of target position. Finally, the extensive tests using PASCAL VOC and MSCOCO datasets, the experimental result shows that our method not only has the best recognition performance, but also has better stability and robustness.

引用

页码：2685 / 2695

页数：11

共 43 条

[1] Measuring the Objectness of Image Windows
Alexe, Bogdan
Deselaers, Thomas
Ferrari, Vittorio
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) : 2189 - 2202
[2] Object recognition algorithm based on optimized nonlinear activation function-global convolutional neural network
An, Feng-Ping
Liu, Jun-e
Bai, Lei
[J]. VISUAL COMPUTER, 2022, 38 (02) : 541 - 553
[3] Ghost Target Detection in 3D Radar Data using Point Cloud based Deep Neural Network
Chamseddine, Mahdi
Rambach, Jason
Stricker, Didier
Wasenmueller, Oliver
[J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10398 - 10403
[4] False-Alarm-Controllable Radar Detection for Marine Target Based on Multi Features Fusion via CNNs
Chen, Xiaolong
Su, Ningyuan
Huang, Yong
Guan, Jian
[J]. IEEE SENSORS JOURNAL, 2021, 21 (07) : 9099 - 9111
[5] Dynamic DETR: End-to-End Object Detection with Dynamic Attention
Dai, Xiyang
Chen, Yinpeng
Yang, Jianwei
Zhang, Pengchuan
Yuan, Lu
Zhang, Lei
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2968 - 2977
[6] Deshpande A, 2021, NEUROSN INFORM, V1
[7] The Pascal Visual Object Classes (VOC) Challenge
Everingham, Mark
Van Gool, Luc
Williams, Christopher K. I.
Winn, John
Zisserman, Andrew
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
[8] Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector
Fan, Qi
Zhuo, Wei
Tang, Chi-Keung
Tai, Yu-Wing
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4012 - 4021
[9] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
[10] Jais I. K. M., 2019, Knowl. Eng. Data Sci., V2, P41, DOI DOI 10.17977/UM018V2I12019P41-46

← 1 2 3 4 5 →